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PLUS END-DIRECTED MICROTUBULE MOTOR REQUIRED FOR 

CHROMOSOME CONGRESSION 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims the benefit of USSN 60/058,645, filed September 
11, 1997, herein incorporated by reference. 

FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
This invention was made with Government support under Grant Nos. 
GM35252 and GM 29513, awarded by the National Institutes of Health. The 
Government has certain rights in this invention. 



FIELD OF THE INVENTION 
The invention provides isolated nucleic acid and amino acid sequences of 
Xenopus CENP-E (XCENP-E), antibodies to XCENP-E, methods of screening for 
CENP-E modulators using biologically active CENP-E, and kits for screening for CENP- 
E modulators. 



BACKGROUND OF THE INVENTION 
Segregation of genetic material during mitosis is mediated by the 
microtubules of the mitotic spindle {see, e.g., Mcintosh, in Microtubules , pp. 413-434 
(Hyams & Lloyd, eds., 1994). During mitosis, chromosomes are dynamically attached to 
spindle microtubules via the kinetochore, which is a structure located at the centromere 
of the chromosome. Kinetochores are involved in coordinating chromosome movement 
via microtubule assembly and disassembly. The kinetochore and its component proteins 
thus play an important role in the dynamics of mitosis. 

Spindle microtubules have a defined polarity, with their slow-growing, 
"minus" ends anchored at or near the spindle pole, and their dynamic, fast-growing 
"plus" ends interacting with chromosomes (Mcintosh, et al., J. Cell Biol. 98:525-533 
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(1984)). During prometaphase, chromosomes establish interactions with the fast-growing 
plus ends of microtubules via the kinetochore. Chromosomes then undergo a series of 
microtuouie-flepenaent movemenis, culminating m augumbiu. <u mw 
equidistant from the two spindle poles. This process is called "congression." However, 
the molecular mechanisms underlying chromosome congression are poorly understood 
(see, e.g., Rieder, et al, J. Cell Biol. 124:223-33 (1994)). A major question has been 
whether any kinetochore-associated microtubule motors play an important role in 
congression. 

The two predominant and opposing forces are currently thought to be 
responsible for chromosome movement during congression: (1) an anti-poleward polar 
force associated with regions of high microtubule density near the spindle poles, and (2) a 
poleward force generated at the kinetochore (Khodjakov, et al., J. Cell Biol. 135:315-327 
(1996); Waters, et al, J. Cell Sci. 109:2823-2831 (1996); reviewed in Rieder, et al., Int. 
Rev. Cytol. 79:1-57 (1982); Mitchison, et al., Annu. Rev. Cell Biol. 4:527-49 (1988); 
Rieder, et al., J. Cell Biol. 124:223-33 (1994)). 

Studies in vitro have demonstrated the presence of both plus and minus 
end-directed microtubule motor activities on kinetochores that may be responsible for 
these chromosome movements (Mitchison, et al., J. Cell Biol. 101:766-77 (1985); 
Hyman, et al., Nature 351:206-211 (1991)). The outstanding issue, however, has been 
the identity of the molecules at the kinetochore which act as motors and generate the 
force for chromosome movement. 

In general, both genetic and biochemical approaches have demonstrated 
crucial roles for microtubule motors in spindle assembly, spindle pole separation, and 
regulation of spindle microtubule dynamics. These motors include Eg5, CHOl/MKlpl, 
ncd, cut7, bimC, CIN8, KIP1, KAR3, Xklp2, XKCM1, and XCTK2 (Sawin, et al, 
Nature 359:540-543 (1992); Blangy, et al., Cell 83:1159-1169 (1995); Sawin, et al, J. 
Cell Biol. 112:925-940 (1991); Nislow, et al, J. Cell Biol. 111:511-522 (1990); Endow, 
et al, J. Cell Sci. 107:859-867 (1994); Hagan, et al, Nature 347:563-566 (1990); 
Hagan, et al, Nature 356:74-76 (1992); Enos, et al, Cell 60:1019-1027 (1990); Hoyt, et 
al, J. Cell Biol. 118:109-120; Roof, et al, J. Cell Biol. 118:95-108 (1992); Saunders, et 
al, Cell 70:451-458 (1992), Boleti, et al, J. Cell Biol. 125:1303-1312; Walczak, et al, 
Cell 84:37-47 (1996); Walczak, et al, J. Cell Biol. 136:859-70 (1997)). Two kinesin 
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superfamily members, Xenopus Xklpl and Drosophila nod localize to chromosome arms. 
With the exception of these two chromatin-associated motors, which are thought to 
mediate polar ejection forces, none of these other proteins have been implicated directly 
in congression or in chromosome movement during other phases of mitosis (Theurkauf, 
5 etaL, J. Cell Biol, 116:1167-1180 (1992); Afshar, et aL, Cell 81:129, Ce// 81:128-138 
(1995); Vernos, etaL, Trends in Cell BioL 5:297-301 (1995)). 

A candidate for powering chromosome movement in mitosis is 
centromere-associated protein-E (CENP-E), a member of the kinesin superfamily of 
microtubule motor proteins. Human CENP-E has been cloned and is an integral 

10 component of the kinetochore (Yen, et aL, Nature 359:536-539 (1992); Yao, et al. r The 
microtubule motor CENP-E is an integral component of kinetochore corona fibers that 
link centromeres to spindle microtubules (manuscript)). CENP-E localizes to 
kinetochores throughout all phases of mitotic chromosome movement (early prometaphase 
through anaphase A) (Yen, et aL, Nature 359:536-539 (1992); Brown, et aL, 7. Cell 

15 BioL 125:1303-1312 (1994); Lombillo, etaL, 7. Cell BioL 128:107-115 (1995)). 

Previous efforts have suggested a role for CENP-E in mitosis. 
Microinjection of a monoclonal antibody directed against CENP-E into cultured human 
cells delays anaphase onset (Yen, et aL, EMBO 7. 10:1245-1254 (1991)). Anti-CENP-E 
antibody injection into maturing mouse oocytes induces arrest at the first reductional 

20 division of meiosis (Duesbery, et aL, Proc. NatL Acad. ScL USA (in press, 1997)). 

Antibodies against CENP-E block microtubule depolymerization-dependent minus end- 
directed movement of purified chromosomes in vitro (Lombillo, et aL, 7. Cell BioL 
128:107-115 (1995)). 

However, these experiments have not demonstrated the precise role of 

25 CENP-E in mitosis, nor have they shown the activity of CENP-E, in particular any motor 
activity. Recently, CENP-E was reported to be associated with minus end-directed 
microtubule motor activity, raising the possibility that CENP-E might be responsible for 
poleward kinetochore movements (Thrower, et aL, EMBO J. 14:918-926 (1995)). 
However, biologically active CENP-E has never been isolated, neither from naturally 

30 occurring nor recombinant sources. 
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SUMMARY OF THE INVENTION 
The present invention provides for the first time biologically active 

that TFIMP-F. i« a 

CfcNP-fc, ana surprisingly ueiiioiisuaic*, wuuaij w ^tv«>uua - — - 

motor that powers chromosome movement toward microtubule plus ends. Using 
immunodepletion and antibody addition to Xenopus egg extracts, the present invention 
further demonstrates that CENP-E plays an essential role in congression. The present 
invention also provides for the first time the nucleotide and amino acid sequence of 
isolated Xenopus CENP-E. 

In one aspect, the invention provides an isolated, biologically active 
CENP-E protein, wherein the CENP-E protein has the following properties: (i) at least 
one activity selected from the group consisting of plus end-directed microtubule motor 
activity, ATPase activity, and microtubule binding activity; and (ii) the ability to 
specifically bind to polyclonal antibodies generated against CENP-E. In one 
embodiment, the CENP-E protein has an average molecular weight of about 300-350 
kDa. 

In one embodiment, the CENP-E protein has an amino acid sequence 
having at least 34%, or alternatively at least 45%, or alternatively at least 55% sequence 
identity with a XCENP-E motor domain of SEQ ID NO:l. Alternatively, CENP-E has at 
least 60%, 65% or 70% sequence identity with a XCENP-E motor domain of SEQ ID 
NO:l. In an alternative embodiment, the CENP-E has 70%, or alternatively 75%, or 
alternatively 80%, or alternatively 85%, or alternatively 90% or alternatively 95% amino 
acid sequence identity to a Xenopus CENP-E core motor domain as measured using a 
sequence comparison algorithm. In an alternative embodiment, the CENP-E protein has 
an amino acid sequence of SEQ ID NO:l. 

In another embodiment provided herein, the CENP-E protein is encoded by 
a nucleic acid sequence having at least 70% sequence identity with SEQ ID NO:2. In 
another aspect of the present invention, the CENP-E protein is encoded by a nucleic acid 
which hybridizes under high stringency to a nucleic acid having a sequence 
complementary to that of SEQ ID NO:2. 

In one embodiment, the CENP-E protein is from a human. In alternative 
embodiments provided herein, the CENP-E protein is from fungus, insects, or plants. 
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In an alternative embodiment provided herein, the CENP-E protein 
specifically binds to antibodies generated against Xenopus CENP-E (XCENP-E). In this 
embodiment, the CENP-E protein has an amino acid sequence having greater than 70%, 
or alternatively 75% sequence identity with a XCENP-E motor domain of SEQ ID NO:l. 

5 In another embodiment, the CENP-E protein has an amino acid sequence of a XCENP-E 
motor domain of SEQ ID NO:l. 

In the embodiments wherein the CENP-E is biologically active as described 
herein, the amino acid sequence can have 74% or less sequence identity with the motor 
domain of SEQ ID NO:l. 

10 Also provided herein is an isolated nucleic acid sequence encoding a 

CENP-E gene product, said sequence encoding a protein having a core motor domain that 
has greater than 70% or alternatively 75% amino acid sequence identity to a Xenopus 
CENP-E (XCENP-E) core motor domain as measured using a sequence comparison 
algorithm, and specifically binding to antibodies raised against CENP-E. In one 

15 embodiment, the sequence has a nucleotide sequence of SEQ ID NO:2. The sequence 
comparison algorithm can be PILEUP. 

In another aspect of the invention, an antibody which specifically binds to 
CENP-E is provided. 

Also provided herein is a method for identifying a candidate agent as a 

20 compound which modulates CENP-E activity. The method comprising the steps of 
determining CENP-E activity in the presence of a candidate agent at a control 
concentration. The CENP-E activity is selected from the group consisting of plus 
end-directed microtubule motor activity, ATPase activity and microtubule binding 
activity. The method further comprises the steps of determining said CENP-E activity in 

25 the presence the candidate agent at a test concentration, wherein a change in activity 
between the test concentration and the control concentration of said candidate agent 
indicates the identification of a compound which modulates CENP-E activity. The 
method can further comprise the step of isolating biologically active CENP-E from a cell 
sample. 

30 The compound to be identified can be a lead therapeutic, bioagricultural 

compound or diagnostic. Preferably the compound is an antibody which specifically 
binds CENP-E. In one embodiment the method further comprises the step of modifying 
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the antibody to be a humanized antibody. In one embodiment, the method is performed 
in a plurality such that many candidate agents are screened simultaneously. 

ine invention aiso memoes kus iui sciccumg ivi inuuuuuuia ui ^m-^. 
The kit includes a container holding biologically active CENP-E and instructions for 
5 assaying for CENP-E activity, wherein the CENP-E activity is plus end-directed 
microtubule motor activity or ATPase activity. 

The invention also provides a method of producing a biologically active 
CENP-E polypeptide. The method includes the steps of transforming a cell with a vector 
comprising the nucleic acid sequence encoding the motor domain of CENP-E; expressing 
10 said nucleic acid to produce a gene product; purifying said gene product; and identifying 
ATPase activity or plus-end directed microtubule activity of said gene product. 

In another aspect of the invention, a method of moving microtubules in a 
plus ended direction is provided wherein microtubules are contacted with biologically 
active CENP-E. 

15 In one embodiment the CENP-E is provided in gene form to a cell 

comprising microtubules. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A-C: Identification of Xenopus CENP-E 

20 Figure 1A: Structural comparison of Xenopus and human CENP-E. 

Hatched regions represent regions predicted to form a-helical coiled coils (Lupas, et al } 
Science 252, 1162-1164 (1991)). Within the N-terminal globular domains of both 
hCENP-E and XCENP-E there is a domain of -324 amino acids corresponding to the 
kinesin like motor domain. Within these 324 amino acids XCENP-E and hCENP-E are 

25 74% identical. One cDNA clone encoded a protein with a 9 amino acid insertion relative 
to the other cDNAs isolated (see Example I and methods). The position of this insertion 
is marked by the arrowhead. XCENP-E contains a putative nuclear localization signal 
(NLS) at the C-terminal end of the rod domain not present in hCENP-E. 

Figure IB: XCENP-E fusion proteins used for polyclonal antibody 

30 production. 
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Figure 1C: Deduced amino acid sequence ofXenopus CENP-E. cDNA 
sequence was compiled from 6 overlapping cDNA clones. Residues identical in 
hCENP-E and XCENP-E are shaded. The boxed region at the amino-terminus of the 
sequence is that portion of XCENP-E containing the motor domain and used to assay 
5 motility in vitro. The boxed sequence at the C-terminus is that portion of XCENP-E 

designated as the tail. The underlined sequence NSREHSINA (SEQ ID NO:3) at position 
599 is the 9 amino acid relative insertion encoded by one of the cDNAs isolated (see 
Figure 1A). The putative NLS, RKKTK (SEQ ID NO:4), immediately adjacent to the 
boxed tail domain is underlined. 

10 

Figure 2A-B: XCENP-E is a Plus End-Directed Microtubule Motor 

Figure 2A: Expression of recombinant XCENP-E in E. coli. XCENP-E 
amino acid residues 1-473 of XCENP-E were fused at the C-terminus to a c-myc epitope 
followed by a hexahistidine tag, expressed in E. coli, and purified over Ni-NTA-agarose 

15 resin. Coomassie stain of XCENP-E fusion protein used for motility (lane 1), 

immunoblot of XCENP-E fusion protein probed with a-myc monoclonal antibody (lane 
2). Arrowheads indicate XCENP-E fusion protein. 

Figure 2B: XCENP-E Motility Assay. Microtubules marked near their 
minus ends with brightly fluorescent seeds were added with ATP to a flow chamber 

20 containing purified XCENP-E fusion protein tethered to the coverslip with a-myc 
monoclonal antibody. Gliding of microtubules was monitored by time-lapse digital 
fluorescence microscopy. Selected time points from one time lapse series, spaced 90 
seconds apart are presented. As reference points, the positions of the plus ends of 
microtubules numbered 1, 2, and 3 at the start of continual gliding are marked with solid 

25 white dots, and the position of a stationary microtubule end is marked with an 

arrowhead. The bright seed of microtubule 3 enters the plane of focus at 1.5 minutes, 
and glides 13.6 /iM downward with the bright seed leading over the following 3 minutes. 
Microtubule 2 moves continually during the first three minutes, after which point it 
detaches and reattaches further toward the bottom of the frame. Microtubule 1 glides 

30 minus-end leading throughout the entire time course. The average microtubule velocity 
of all microtubules was 5.1 ^m/min +. 1.7 (n=49). Of those, 33 microtubules were 
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unambiguously polarity marked, and all glided with their bright seeds leading. Scalebar 
is 5 fxm. 

DETAILED DESCRIPTION OF THE INVENTION 

I. Introduction 

The present invention provides for the first time biologically active CENP- 
E and demonstrates that CENP-E has a plus end-directed microtubule motor activity. 
Biologically active CENP-E was used to show that the kinesin-like motor domain of 
CENP-E powers chromosome movement toward microtubule plus ends. Finally, 
quantitative removal of Xenopus CENP-E ("XCENP-E") from Xenopus egg extracts 
normally capable of assembling mitotic spindles in vitro impairs congression of 
chromosomes to the metaphase plate. Together, these findings demonstrate that CENP-E 
plus-end directed microtubule motor activity in vivo is essential for congression during 
mitosis. 

Functionally, CENP-E is localized in the kinetochores of condensed 
chromosomes in mitotic cells and has a plus-end directed microtubule motor activity that 
is ATP dependent (see, e.g., Example II, where ATP or another nucleotide triphosphate 
is included in the motility assay for motor activity). This activity is responsible for 
chromosome movement during mitosis. Structurally, the full length nucleotide sequence 
of XCENP-E (SEQ ID NO: 2) encodes a protein of 2954 amino acids with a predicted 
molecular mass of 340 kDa (SEQ ID NO:l, Figure 1C). XCENP-E is a member of the 
kinesin superfamily of motor proteins as evidenced by the sequence of its motor domain. 
The predicted structure of XCENP-E consists of a 500 amino acid globular 
amino-terminal domain containing a kinesin-like microtubule motor domain linked to a 
globular tail domain by a region predicted to form a long, discontinuous o-helical coiled 
coil (Lupas, et al., Science 252, 1162-1164 (1991); Berger, et al., Proc. Natl. Acad. Sci. 
USA 92:8259-8263 (1995)) (Figure 1A). Within the core of the motor domain (residues 
1-324) XCENP-E and human CENP-E ("hCENP-E") share 74% identity (Moore, et al., 
Bioessays 18:207-219 (1996)). Outside the amino-terminal domain lie three additional 
regions which share greater than 25% identity with hCENP-E, but not with other 
kinesin-like proteins (Figure 1). CENP-E is found in Xenopus, mammalian cells, and is 
predicted to exist in some fungi and perhaps Drosophila. 



*• WO 99/13061 



PCT/IIS98/19231 



9 

The isolation of biologically active CENP-E for the first time provides a 
means for assaying for enhancers or inhibitors (i.e., modulators) of this essential mitotic 
protein. Biologically active CENP-E is useful for testing for enhancers or inhibitors 
using in vitro assays such as microtubule gliding assays (see, e.g., Example II) or 

5 ATPase assays (Kodama et aL, 7. Biochem. 99: 1465-1472 (1986); Stewart et aL t Proc. 
Nat'lAcad. Sci. USA 90: 5209-5213 (1993); Sakowicz etaL, Science 280:292-295 
(1998)). For example, inhibitors identified using biologically active CENP-E can be used 
therapeutically to treat diseases of proliferating cells, including, e.g., cancers, 
hyperplasias, restenosis, cardiac hypertrophy, immune disorders, and inflammation. 

10 CENP-E also provides a convenient diagnostic marker for dividing cells. Antibodies or 
other probes for CENP-E can be used in vitro to identify cells that are entering mitosis. 
Inhibitors of CENP-E can also be used in vitro to synchronize cells just prior to entry 
into mitosis for use in cell culture. 



15 II. Definitions 

As used herein, the following terms have the meanings ascribed to them 

unless specified otherwise. 

The terms "isolated" "purified" or "biologically pure" refer to material that 
is substantially or essentially free from components which normally accompany it as 

20 found in its native state. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography. A protein that is the predominant species present in 
a preparation is substantially purified. In particular, an isolated XCENP-E nucleic acid is 
separated from open reading frames which flank the XCENP-E gene and encode proteins 

25 other than XCENP-E. The term "purified" denotes that a nucleic acid or protein gives 
rise to essentially one band in an electrophoretic gel. Particularly, it means that the 
nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and most 
preferably at least 99% pure. 

The term "nucleic acid" refers to deoxy ribonucleotides or ribonucleotides 

30 and polymers thereof in either single- or double-stranded form. Unless specifically 
limited, the term encompasses nucleic acids containing known analogues of natural 
nucleotides which have similar binding properties as the reference nucleic acid and are 
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metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence also implicitly encompasses conservatively 
modified variants tnereot (e.g., degenerate union suumhuuuiu; <mu 
sequences and as well as the sequence explicitly indicated. Specifically, degenerate 
codon substitutions may be achieved by generating sequences in which the third position 
of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al., Nucleic Acid Res. 79:5081 (1991); Ohtsuka et al., J. 
Biol. Chem. 260:2605-2608 (1985); Cassol et al., 1992; Rossolini et al., Mol. Cell. 
Probes 8:91-98 (1994)). The term nucleic acid is used interchangeably with gene, 
cDNA, and mRNA encoded by a gene. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an artificial chemical analogue of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. A CENP-E polypeptide comprises a polypeptide demonstrated to have at 
least ATPase activity or plus end-directed microtubule motor activity and that binds to an 
antibody generated against CENP-E. 

A "label" is a composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include 
32 P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an 
ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal 
antibodies are available (e.g., the peptide of SEQ ID NO:l can be made detectable, e.g., 
by incorporating a radio-label into the peptide, and used to detect antibodies specifically 
reactive with the peptide). 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through complementary base 
pairing, usually through hydrogen bond formation. As used herein, a probe may include 
natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In 
addition, the bases in a probe may be joined by a linkage other than a phosphodiester 
bond, so long as it does not interfere with hybridization. Thus, for example, probes may 
be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather 
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than phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe sequence 
depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
5 labeled such as with biotin to which a streptavidin complex may later bind. By assaying 
for the presence or absence of the probe, one can detect the presence or absence of the 
select sequence or subsequence. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, 
either covalently, through a linker, or through ionic, van der Waals or hydrogen bonds to 

10 a label such that the presence of the probe may be detected by detecting the presence of 
the label bound to the probe. 

"Amplification" primers are oligonucleotides comprising either natural or 
analogue nucleotides that can serve as the basis for the amplification of a select nucleic 
acid sequence. They include, e.g., polymerase chain reaction primers and ligase chain 

15 reaction oligonucleotides. 

The term "recombinant" when used with reference to a cell, or nucleic 
acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by 
the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, 
or that the cell is derived from a cell so modified. Thus, for example, recombinant cells 

20 express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 

25 that are the same or have a specified percentage of amino acid residues or nucleotides 
that are the same, when compared and aligned for maximum correspondence over a 
comparison window, as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspection. This definition also refers to 
the complement of a test sequence, which has a designated percent sequence or 

30 subsequence complementarity when the test sequence has a designated or substantial 

identity to a reference sequence. For example, a designated amino acid percent identity 
of 70% refers to sequences or subsequences that have at least about 70% amino acid 
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identity when aligned for maximum correspondence over a comparison window as 
measured using one of the following sequence comparison algorithms or by manual 
alignment and visuai inspeciion. Picfetably, the percent identity exists ever a region of 
the sequence that is at least about 25 amino acids in length, more preferably over a 
region that is 50 or 100 amino acids in length. 

When percentage of sequence identity is used in reference to proteins or 
peptides, it is recognized that residue positions that are not identical often differ by 
conservative amino acid substitutions, where amino acids residues are substituted for 
other amino acid residues with similar chemical properties (e.g., charge or 
hydrophobicity) and therefore do not change the functional properties of the molecule. 
Where sequences differ in conservative substitutions, the percent sequence identity may 
be adjusted upwards to correct for the conservative nature of the substitution. Means for 
making this adjustment are well known to those of skill in the art. Typically this 
involves scoring a conservative substitution as a partial rather than a full mismatch, 
thereby increasing the percentage sequence identity. Thus, for example, where an 
identical amino acid is given a score of 1 and a non-conservative substitution is given a 
score of zero, a conservative substitution is given a score between zero and 1. The 
scoring of conservative substitutions is calculated according to, e.g., the algorithm of 
Meyers & Miller, Computer Applic. Biol. ScL 4:11-17 (1988) e.g., as implemented in the 
program PC/GENE (Intelligenetics, Mountain View, California, USA).. 

For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence 
coordinates are designated, if necessary, and sequence algorithm program parameters are 
designated. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent sequence 
identity for the test sequence(s) relative to the reference sequence, based on the 
designated or default program parameters. 

A "comparison window", as used herein, includes reference to a segment 
of any one of the number of contiguous positions selected from the group consisting of 
from 25 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of 
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contiguous positions after the two sequences are optimally aligned. Methods of alignment 
of sequences for comparison are well-known in the art. Optimal alignment of sequences 
for comparison can be conducted, e.g., by the local homology algorithm of Smith & 
Waterman, Adv. Appl. Math, 2:482 (1981), by the homology alignment algorithm of 
5 Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method 
of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by manual alignment and visual inspection (see, e.g., Ausubel et al. y 
10 supra). 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, pairwise 
alignments. It can also plot a tree showing the clustering relationships used to create the 
alignment. PILEUP uses a simplification of the progressive alignment method of Feng & 

15 Doolittle, J. Mol. Evol. 35:351-360 (1987). The method used is similar to the method 

described by Higgins & Sharp, CABIOS 5: 151-153 (1989). The program can align up to 
300 sequences of a maximum length of 5,000. The multiple alignment procedure begins 
with the pairwise alignment of the two most similar sequences, producing a cluster of two 
aligned sequences. This cluster can then be aligned to the next most related sequence or 

20 cluster of aligned sequences. Two clusters of sequences can be aligned by a simple 

extension of the pairwise alignment of two individual sequences. The final alignment is 
achieved by a series of progressive, pairwise alignments. The program can also be used 
to plot a dendogram or tree representation of clustering relationships. The program is 
run by designating specific sequences and their amino acid or nucleotide coordinates for 

25 regions of sequence comparison, e.g., the core motor region of CENP-E. In one 

example, hCENP-E, XCENP-E and ustilago CENP-E were compared to other kinesin 
superfamily sequences using the following parameters: default gap weight (3.00), default 
gap length weight (0.10), and weighted end gaps. The resulting dendogram placed 
hCENP-E and XCENP-E in one cluster as the most closely related sequences, with 

30 ustilago CENP-E in the next most closely related cluster. 

Another example of algorithm that is suitable for determining percent 
sequence identity (i.e., substantial similarity or identity) is the BLAST algorithm, which 
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is described in Altschul et al, 7. MoL Biol. 215:403-410 (1990). Software for 
performing BLAST analyses is publicly available through the National Center for 
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identifying high scoring sequence pairs (HSPs) by identifying short words of length W in 
the query sequence, which either match or satisfy some positive-valued threshold score T 
when aligned with a word of the same length in a database sequence. T is referred to as 
the neighborhood word score threshold (Altschul et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are then extended in both directions along each sequence for as far 
as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues, always < 0). For 
amino acid sequences, a scoring matrix is used to calculate the cumulative score. 
Extension of the word hits in each direction are halted when: the cumulative alignment 
score falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring residue 
alignments; or the end of either sequence is reached. The BLAST algorithm parameters 
W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an 
expectation (E) of 10, M=5, N=4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as default parameters a wordlength (W) of 3, an 
expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, 
Proc. Natl. Acad. Sci. USA 89:10915 (1989)). 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat 'I. Acad. Sci. USA 
90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest 
sum probability in a comparison of the test nucleic acid to the reference nucleic acid is 
less than about 0.1, more preferably less than about 0.01, and most preferably less than 
about 0.001. 
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An indication that two nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross reactive with the antibodies raised against the polypeptide encoded 
by the second nucleic acid, as described below. Thus, a polypeptide is typically 
substantially identical to a second polypeptide, for example, where the two peptides differ 
only by conservative substitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions, as described below. 

The phrase "hybridizing specifically to" refers to the binding, duplexing, 
or hybridizing of a molecule only to a particular nucleotide sequence under stringent 
conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA 
or RNA. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (T m ) for the specific sequence at a defined ionic strength and pH. The T m 
is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium (as the target sequences are generally present in excess, at T m , 
50% of the probes are occupied at equilibrium). Typically, stringent conditions will be 
those in which the salt concentration is less than about 1.0 M sodium ion, typically about 
0.01 to 1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the 
temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at 
least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions 
may also be achieved with the addition of destabilizing agents such as formamide. 

"High stringency conditions", as defined herein, may be identified by 
those that: (1) employ low ionic strength and high temperature for washing, for example 
0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; 
(2) employ during hybridization a denaturing agent, such as formamide, for example, 
50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% 
polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 
M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 



WO 99/13061 



PCT/WS98/19231 * 



16 

pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 /ig/ml), 0.1% 
SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55 C C, foiiowed by a high-siringcucy 
wash consisting of 0.1 x SSC containing EDTA at 55°C. 

5 "Moderately stringent conditions" may be identified as described by 

Sambrook et al } Molecular Cloning: A Laboratory Manual, New York: Cold Spring 
Harbor Press, 1989, and include the use of washing solution and hybridization conditions 
(e.g., temperature, ionic strength and %SDS) less stringent that those described above. 
An example of moderately stringent conditions is overnight incubation at 37 °C in a 

10 solution comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran 
sulfate, and 20 mg/mL denatured sheared salmon sperm DNA, followed by washing the 
filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust the 
temperature, ionic strength, etc. as necessary to accommodate factors such as probe 

15 length and the like." 

The phrase "a sequence encoding a gene product" refers to a nucleic acid 
that contains sequence information, e.g., for a structural RNA such as rRNA, a tRNA, 
the primary amino acid sequence of a specific protein or peptide, a binding site for a 
trans-acting regulatory agent, an antisense RNA or a ribozyme. This phrase specifically 

20 encompasses degenerate codons (i.e., different codons which encode a single amino acid) 
of the native sequence or sequences which may be introduced to conform with codon 
preference in a specific host cell. 

"Antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof which specifically 

25 bind and recognize an analyte (antigen). The recognized immunoglobulin genes include 
the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well 
as the myriad immunoglobulin variable region genes. Light chains are classified as either 
kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, 
which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, 

30 respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each 
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pair having one "light" (about 25 kDa) and one "heavy" chain (about 50-70 kDa). The 
N-terminus of each chain defines a variable region of about 100 to 110 or more amino 
acids primarily responsible for antigen recognition. The terms variable light chain (VJ 
and variable heavy chain (V H ) refer to these light and heavy chains respectively. 
5 Antibodies exist e.g., as intact immunoglobulins or as a number of well 

characterized fragments produced by digestion with various peptidases. Thus, for 
example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab)' 2i a dimer of Fab which itself is a light chain joined to V H -C H 1 by a 
disulfide bond. The F(ab)' 2 may be reduced under mild conditions to break the disulfide 
10 linkage in the hinge region, thereby converting the F(ab)' 2 dimer into an Fab' monomer. 
The Fab' monomer is essentially an Fab with part of the hinge region {see Fundamental 
Immunology (Paul, ed., 3d ed. 1993). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such 
fragments may be synthesized de novo either chemically or by using recombinant DNA 
15 methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies or those synthesized de novo 
using recombinant DNA methodologies (e.g., single chain Fv). 

An "anti-XCENP-E" antibody is an antibody or antibody fragment that 
specifically binds a polypeptide encoded by the XCENP-E gene, cDNA, or a 
20 subsequence thereof. 

Humanized forms of non-human antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other 
antigen- binding subsequences of antibodies) which contain minimal sequence derived 
from non-human immunoglobulin. Humanized antibodies include human 
25 immunoglobulins (recipient antibody) in which residues from a complementary 

determining region (CDR) of the recipient are replaced by residues from a CDR of a 
non-human species (donor antibody) such as mouse, rat or rabbit having the desired 
specificity, affinity and capacity. In some instances, Fv framework residues of the 
human immunoglobulin are replaced by corresponding non-human residues. Humanized 
30 antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody 
will comprise substantially all of at least one, and typically two, variable domains, in 
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which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the FR regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 

5 human immunoglobulin (Jones et al, Nature, 321:522-525 (1986); Riechmann et al, 
Nature, 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol, 2:593-596 (1992)). 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it 
from a source which is non-human. These non-human amino acid residues are often 

10 referred to as "import" residues, which are typically taken from an "import" variable 

domain. Humanization can be essentially performed following the method of Winter and 
co-workers (Jones et al, Nature, 321:522-525 (1986); Riechmann et al, Nature, 
332:323-327 (1988); Verhoeyen etal, Science, 239:1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 

15 Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from a non-human species. In practice, 
humanized antibodies are typically human antibodies in which some CDR residues and 
possibly some FR residues are substituted by residues from analogous sites in rodent 

20 antibodies. 

Human antibodies can also be produced using various techniques known in 
the art, including phage display libraries (Hoogenboom & Winter, /. Mol Biol, 221 'Ml 
(1991); Marks et al, 7. Mol Biol, 222:581 (1991)). The techniques of Cole et al. and 
Boerner et al. are also available for the preparation of human monoclonal antibodies 

25 (Cole et al, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and 
Boerner et al, J. Immunol, 147(l):86-95 (1991)). Similarly, human antibodies can be 
made by introducing of human immunoglobulin loci into transgenic animals, e.g., mice in 
which the endogenous immunoglobulin genes have been partially or completely 
inactivated. Upon challenge, human antibody production is observed, which closely 

30 resembles that seen in humans in all respects, including gene rearrangement, assembly, 
and antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 
5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following 
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scientific publications: Marks et aL, Bio/Technology 10, 779-783 (1992); Lonberg et aL, 
Nature 368 856-859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwild et aL, 
Nature Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 14, 826 
(1996); Lonberg & Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 

5 A "chimeric antibody" is an antibody molecule in which (a) the constant 

region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding 
site (variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new properties to 
the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) 

10 the variable region, or a portion thereof, is altered, replaced or exchanged with a variable 
region having a different or altered antigen specificity. 

The term "immunoassay" is an assay that uses an antibody to specifically 
bind an analyte. The immunoassay is characterized by the use of specific binding 
properties of a particular antibody to isolate, target, and/or quantify the analyte. 

15 The phrase "specifically (or selectively) binds to an antibody" or 

"specifically (or selectively) immunoreactive with," when referring to a protein or 
peptide, refers to a binding reaction that is determinative of the presence of the protein in 
a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a particular protein and do not 

20 bind in a significant amount to other proteins present in the sample. Specific binding to 
an antibody under such conditions may require an antibody that is selected for its 
specificity for a particular protein. For example, antibodies raised to XCENP-E with the 
amino acid sequence encoded in SEQ ID NO:l can be selected to obtain antibodies 
specifically immunoreactive with that protein and not with other proteins, except for 

25 polymorphic variants. A variety of immunoassay formats may be used to select 

antibodies specifically immunoreactive with a particular protein. For example, solid- 
phase ELISA immunoassays are routinely used to select antibodies specifically 
immunoreactive with a protein (see, e.g., Harlow & Lane, Antibodies, A Laboratory 
Manual (1988), for a description of immunoassay formats and conditions that can be used 

30 to determine specific immunoreactivity). Typically a specific or selective reaction will be 
at least twice background signal or noise and more typically more than 10 to 100 times 
background. 
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The phrase "plus end-directed microtubule motor activity" refers to the 
activity of a motor protein such as CENP-E to power movement toward the "plus" ends 
of microtubules. Microtubules are conventionally refeneu to as having plus (fast 
growing) and minus ends (slow growing). For example, microtubules of the mitotic 

5 spindle have their slow growing, minus ends anchored at or near the spindle pole, and 
their dynamic, fast growing plus ends interacting with chromosomes and with 
microtubules emanating from the opposite pole. 

The term "motor domain" or "core motor domain" refers to the domain of 
CENP-E that confers the plus end-microtubule motor activity on the protein. 

10 "CENP-E" refers to centromere-associated protein, which is a member of 

the kinesin superfamily of microtubule motor proteins. CENP-E is an integral component 
of the kinetochore structure of the chromosome, which links the chromosome to the 
spindle microtubules. "XCENP-E" refers to CENP-E isolated from Xenopus. CENP-E 
has activity such as ATPase activity, microtubule binding activity, and plus end-directed 

15 microtubule motor activity. 

"Modulators of CENP-E" refers to modulatory molecules identified using 
an in vitro assays for CENP-E activity (e.g., inhibitors and activators or enhancers). 
Such assays include ATPase activity, microtubule gliding, spindle assembly, microtubule 
depolymerizing activity, and metaphase arrest. Samples or assays that are treated with a 

20 at least one candidate agent at a test concentration are compared to control samples 

having the candidate agent at a control concentration (which can be zero), to examine the 
extent of modulation. Control samples are assigned a relative CENP-E activity value of 
100. Modulation of CENP-E is achieved when the CENP-E activity value relative to the 
control is increased or decreased about at least 10%, 20%, 30%, 40%, 50%, 75%, or 

25 preferably, at least 100%. 

"Biologically active" CENP-E refers to CENP-E that has at least one 
activity selected ATPase activity, microtubule binding activity, and plus end-directed 
microtubule motor activity, as tested in an ATPase assay, microtubule binding assay, or a 
microtubule gliding assay. "ATPase activity" refers to the ability of CENP-E to 

30 hydrolyze ATP. In a preferred embodiment, CENP-E has plus-end directed microtubule 
activity. 
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ffl. Isolation of the XCENP-E gene 

A. General Recombinant DNA Methods 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al., Molecular Cloning, A Laboratory Manual (2nd ed. 1989); Kriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel et al., eds., 1994)). 

For nucleic acids, sizes are given in either kilobases (kb) or base pairs 
(bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are 
given in kilodaltons (kDa) or amino acid residue numbers. Proteins sizes are estimated 
from gel electrophoresis, from sequenced proteins, from derived amino acid sequences, 
or from published protein sequences. 

Oligonucleotides that are not commercially available can be chemically 
synthesized according to the solid phase phosphoramidite triester method first described 
by Beaucage & Caruthers, Tetrahedron Letts., 22:1859-1862 (1981), using an automated 
synthesizer, as described in Van Devanter et. al, Nucleic Acids Res., 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis 
or by anion-exchange HPLC as described in Pearson & Reanier, 7. Chrom., 255:137-149 
(1983). 

The sequence of the cloned genes and synthetic oligonucleotides can be 
verified after cloning using, e.g., the chain termination method for sequencing 
double-stranded templates of Wallace et al., Gene, 16:21-26, 1981. 

B. Cloning methods for the isolation of nucleotide sequences encoding XCENP-E 

In general, the nucleic acid sequences encoding XCENP-E and related 
nucleic acid sequence homologues are cloned from cDNA and genomic DNA libraries or 
isolated using amplification techniques with oligonucleotide primers. For example, 
XCENP-E sequences can be isolated from Xenopus DNA libraries by hybridizing with a 
nucleic acid probe, the sequence of which can be derived from human CENP-E. 
XCENP-E and XCENP-E homologues that are substantially identical to XCENP-E can be 
isolated using XCENP-E nucleic acid probes and oligonucleotides under stringent 
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hybridization conditions, by screening libraries. Alternatively, expression libraries can 
be used to clone XCENP-E and XCENP-E homologues, by detecting expressed 
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that also recognize and selectively bind to the XCENP-E homologue. Finally, 

5 amplification techniques using primers can be used to amplify and isolate XCENP-E from 
DNA or RNA. Amplification techniques using degenerate primers can also be used to 
amplify and isolate XCENP-E homologues. Amplification techniques using primers can 
also be used to isolate a nucleic acid encoding XCENP-E. These primers can be used, 
e.g., to amplify a probe of several hundred nucleotides, which is then used to screen a 

10 library for full-length XCENP-E. The following primers can be used in such a manner: 
SEQ ID NO: 5 GGGCTGCCCAGGAAGAG and SEQ ID NO:6 
GACAGCATTGATCGGCG. Alternatively, the nucleic acid for XCENP-E can be 
directly amplified using the following primers: SEQ ID NO:7 
GAGGGTTCGGCCGCTTA and SEQ ID NO:8 TCTGGGGCCATCCATGC. 

15 Appropriate primers and probes for identifying the gene encoding CENP-E 

in other species such as Drosophila and fungi are generated from comparisons of the 
sequences provided herein (SEQ ID NOS:l and 2). As described above, antibodies can 
be used to identify XCENP-E homologues. For example, antibodies made to the motor 
domain of XCENP-E, the tail domain of XCENP-E, or to the whole protein are useful 

20 for identifying XCENP-E homologues (see Example section, below). 

To make a cDNA library, one should choose a source that is rich in the 
mRNA of choice, e.g., XCENP-E. For example, ovary tissue is enriched for XCENP-E 
mRNA. The mRNA is then made into cDNA using reverse transcriptase, ligated into a 
recombinant vector, and transfected into a recombinant host for propagation, screening 

25 and cloning. Methods for making and screening cDNA libraries are well known {see, 
e.g., Gubler & Hoffman, Gene 25: 263-269 (1983); Sambrook et al, supra; Ausubel et 
ai, supra). 

For a genomic library, the DNA is extracted from the tissue and either 
mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. 
30 The fragments are then separated by gradient centrifugation from undesired sizes and are 
constructed in bacteriophage lambda vectors. These vectors and phage are packaged in 
vitro. Recombinant phage are analyzed by plaque hybridization as described in Benton & 
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Davis, Science 196:180-182 (1977). Colony hybridization is carried out as generally 
described in Grunstein et al., Proc. Natl. Acad. Sci. USA., 72:3961-3965 (1975). 

An alternative method of isolating XCENP-E nucleic acid and its 
homologues combines the use of synthetic oligonucleotide primers and amplification of an 

5 RNA or DNA template (see U.S. Patents 4,683,195 and 4,683,202; PCR Protocols: A 
Guide to Methods and Applications (Innis et al., eds, 1990)). Methods such as 
polymerase chain reaction (PCR) and ligase chain reaction (LCR) can be used to amplify 
nucleic acid sequences of CENP-E directly from mRNA, from cDNA, from genomic 
libraries or cDNA libraries. Degenerate oligonucleotides can be designed to amplify 

10 XCENP-E homologues using the sequences provided herein. Restriction endonuclease 
sites can be incorporated into the primers. Polymerase chain reaction or other in vitro 
amplification methods may also be useful, for example, to clone nucleic acid sequences 
that code for proteins to be expressed, to make nucleic acids to use as probes for 
detecting the presence of CENP-E encoding mRNA in physiological samples, for nucleic 

15 acid sequencing, or for other purposes. Genes amplified by the PCR reaction can be 
purified from agarose gels and cloned into an appropriate vector. 

Gene expression of CENP-E can also be analyzed by techniques known in 
the art, e.g., reverse transcription and amplification of mRNA, isolation of total RNA or 
poly A+ RNA, northern blotting, dot blotting, in situ hybridization, RNase protection 

20 and the like. 

Synthetic oligonucleotides can be used to construct recombinant XCENP-E 
genes for use as probes or for expression of protein. This method is performed using a 
series of overlapping oligonucleotides usually 40-120 bp in length, representing both the 
sense and nonsense strands of the gene. These DNA fragments are then annealed, ligated 

25 and cloned. Alternatively, amplification techniques can be used with precise primers to 
amplify a specific subsequence of the XCENP-E gene. The specific subsequence is then 
ligated into an expression vector. 

The gene for Xenopus CENP-E is typically cloned into intermediate vectors 
before transformation into prokaryotic or eukaryotic cells for replication and/or 

30 expression. These intermediate vectors are typically prokaryote vectors or shuttle 
vectors. 
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C. Expression in prokaryotes and eukaryotes 

To obtain high level expression of a cloned gene, such as those cDNAs 

minimum, a strong promoter to direct transcription, a ribosome binding site for 

5 translation^ initiation, and a transcription/translation terminator. Suitable bacterial 

promoters are well known in the art and described, e.g., in Sambrook et al and Ausubel 
et al Bacterial expression systems for expressing the CENP-E protein are available in, 
e.g., E. coll Bacillus sp. f and Salmonella (Palva et al, Gene 22:229-235 (1983); 
Mosbach, et al, Nature, 302:543-545 (1983). Kits for such expression systems are 

10 commercially available. Eukaryotic expression systems for mammalian cells, yeast, and 
insect cells are well known in the art and are also commercially available. The pET23D 
expression system (Novagen) is a preferred prokaryotic expression system. 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary 

15 nucleic acid sequences near the start site of transcription, such as, in the case of a 
polymerase II type promoter, a TATA element. A promoter also optionally includes 
distal enhancer or repressor elements which can be located as much as several thousand 
base pairs from the start site of transcription. A "constitutive" promoter is a promoter 
which is active under most environmental and developmental conditions. An "inducible" 

20 promoter is a promoter which is under environmental or developmental regulation. The 
term "operably linked" refers to a functional linkage between a nucleic acid expression 
control sequence (such as a promoter, or array of transcription factor binding sites) and a 
second nucleic acid sequence, wherein the expression control sequence directs 
transcription of the nucleic acid corresponding to the second sequence. 

25 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences which are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 
arranged to make a new functional nucleic acid. 

30 The promoter used to direct expression of a heterologous nucleic acid 

depends on the particular application. The promoter is preferably positioned about the 
same distance from the heterologous transcription start site as it is from the transcription 
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start site in its natural setting. As is known in the art, however, some variation in this 
distance can be accommodated without loss of promoter function. 

In addition to the promoter, the expression vector typically contains a 
transcription unit or expression cassette that contains all the additional elements required 

5 for the expression of the CENP-E encoding DNA in host cells. A typical expression 

cassette thus contains a promoter operably linked to the DNA sequence encoding CENP- 
E and signals required for efficient polyadenylation of the transcript, ribosome binding 
sites, and translation termination. The DNA sequence encoding the CENP-E may 
typically be linked to a cleavable signal peptide sequence to promote secretion of the 

10 encoded protein by the transformed cell. Such signal peptides would include, among 

others, the signal peptides from tissue plasminogen activator, insulin, and neuron growth 
factor, and juvenile hormone esterase of Heliothis virescens. Additional elements of the 
cassette may include enhancers and, if genomic DNA is used as the structural gene, 
introns with functional splice donor and acceptor sites. 

15 In addition to a promoter sequence, the expression cassette should also 

contain a transcription termination region downstream of the structural gene to provide 
for efficient termination. The termination region may be obtained from the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 

20 into the cell is not particularly critical. Any of the conventional vectors used for 

expression in eukaryotic or prokaryotic cells may be used. Standard bacterial expression 
vectors include plasmids such as pBR322 based plasmids, pSKF, and fusion expression 
systems such as GST and LacZ. Epitope tags can also be added to recombinant proteins 
to provide convenient methods of isolation. One preferred embodiment of an epitope tag 

25 is c-myc. 

Expression vectors containing regulatory elements from eukaryotic viruses 
are typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus 
vectors, and vectors derived from Epstein Bar virus. Other exemplary eukaryotic vectors 
include pMSG, pAV009/A + , pMTO10/A + , pMAMneo-5, baculovirus pDSVE, and any 
30 other vector allowing expression of proteins under the direction of the SV40 early 

promoter, SV40 later promoter, metallothionein promoter, murine mammary tumor virus 
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promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown 
effective for expression in eukaryotic cells. 

• * t 1 ~~ nana omnlifiPOtiATl 
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such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase. 
Alternatively, high yield expression systems not involving gene amplification are also 
suitable, such as using a bacculovirus vector in insect cells, with a CENP-E encoding 
sequence under the direction of the polyhedrin promoter or other strong baculovirus 
promoters. 

The elements that are typically included in expression vectors also include 
a replicon that functions in E. coli, a gene encoding antibiotic resistance to permit 
selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
such that they do not interfere with the replication of the DNA in eukaryotic cells, if 
necessary. 

Standard transfection methods are used to produce bacterial, mammalian, 
yeast or insect cell lines that express large quantities of CENP-E protein, which are then 
purified using standard techniques (see, e.g., Colley etal., J. Biol. Chem. 
264:17619-17622 (1989); Guide to Protein Purification, in Methods in Enzymology, vol. 
182 (Deutscher ed., 1990)). Transformation of eukaryotic and prokaryotic cells are 
performed according to standard techniques (see, e.g., Morrison, J. Bact., 132:349-351 
(1977); Clark-Curtiss & Curtiss, Methods in Enzymology, 101:347-362 (Wu et ai, eds, 
1983). 

Any of the well known procedures for introducing foreign nucleotide 
sequences into host cells may be used. These include the use of calcium phosphate 
transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing 
cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a 
host cell (see, e.g., Sambrook et al, supra). It is only necessary that the particular 
genetic engineering procedure used be capable of successfully introducing at least one 
gene into the host cell capable of expressing the CENP-E protein. 
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After the expression vector is introduced into the cells, the transfected cells 
are cultured under conditions favoring expression of the CENP-E protein which is 
recovered from the culture using standard techniques identified below. 



5 IV. Purification of CENP-E protein 

Either naturally occurring or recombinant CENP-E can be purified for use 
in functional assays. Naturally occurring CENP-E is purified, e.g., from Xenopus and 
any other source of an XCENP-E homologue, such as Drosophila or fungi. Recombinant 
CENP-E is purified from any suitable expression system. 

10 CENP-E may be purified to substantial purity by standard techniques, 

including selective precipitation with such substances as ammonium sulfate; column 
chromatography, immunopurification methods, and others (see, e.g., Scopes, Protein 
Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et aL, 
supra; and Sambrook et aU y supra). A preferred method of purification is use of Ni- 

15 NTA agarose (Qiagen). 

A number of procedures can be employed when recombinant CENP-E is 
being purified. For example, proteins having established molecular adhesion properties 
can be reversible fused to CENP-E. With the appropriate Iigand, CENP-E can be 
selectively adsorbed to a purification column and then freed from the column in a 

20 relatively pure form. The fused protein is then removed by enzymatic activity. Finally 
CENP-E could be purified using immunoaffinity columns. 



A Purification of CENP-E from recombinant bacteria 

Recombinant proteins are expressed by transformed bacteria in large 

25 amounts, typically after promoter induction; but expression can be constitutive. Bacteria 
are grown according to standard procedures in the art. Because CENP-E is a protein that 
is difficult to isolate with intact biological activity, preferably fresh bacteria cells are used 
for isolation of protein. Use of cells that are frozen after growth but prior to lysis 
typically results in negligible yields of active protein. 

30 Proteins expressed in bacteria may form insoluble aggregates ("inclusion 

bodies"). Several protocols are suitable for purification of CENP-E inclusion bodies. 
For example, purification of inclusion bodies typically involves the extraction, separation 
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and/or purification of inclusion bodies by disruption of bacterial cells, e.g., by incubation 
in a buffer of about 100-150 /ig/ml lysozyme and 0.1% Nonidet P40, a non-ionic 
detergent. The ceii suspension can be homogenized using a Polytron (Bnnkman 
Instruments, Westbury, N.Y.). Alternatively, the cells can be sonicated on ice. 
Alternate methods of lysing bacteria are apparent to those of skill in the art (see, e.g., 
Sambrook et ai, supra; Ausubel et al, supra). 

The cell suspension is generally centrifuged and the pellet containing the 
inclusion bodies resuspended in buffer that does not dissolve but washes the inclusion 
bodies, e.g., 20 mM Tris-HCl (pH 7.2), 1 mM EDTA, 150 mM NaCl and 2% Triton-X 
100, a non-ionic detergent. It may be necessary to repeat the wash step to remove as 
much cellular debris as possible. The remaining pellet of inclusion bodies may be 
resuspended in an appropriate buffer (e.g., 20 mM sodium phosphate, pH 6.8, 150 mM 
NaCl). Other appropriate buffers will be apparent to those of skill in the an. 

Following the washing step, the inclusion bodies are solubilized by the 
addition of a solvent that is both a strong hydrogen acceptor and a strong hydrogen donor 
(or a combination of solvents each having one of these properties); the proteins that 
formed the inclusion bodies may then be renatured by dilution or dialysis with a 
compatible buffer. Suitable solvents include, but are not limited to urea (from about 4 M 
to about 8 M), formamide (at least about 80%, volume/volume basis), and guanidine 
hydrochloride (from about 4 M to about 8 M). Some solvents which are capable of 
solubilizing aggregate-forming proteins, for example SDS (sodium dodecyl sulfate), 70% 
formic acid, are inappropriate for use in this procedure due to the possibility of 
irreversible denaturation of the proteins, accompanied by a lack of immunogenicity 
and/or activity. Although guanidine hydrochloride and similar agents are denaturants, 
this denaturation is not irreversible and renaturation may occur upon removal (by 
dialysis, for example) or dilution of the denaturant, allowing re-formation of 
immunologically and/or biologically active protein. After solubilization, the protein can 
be separated from other bacterial proteins by standard separation techniques. 

Alternatively, it is possible to purify CENP-E from bacteria periplasm. 
Where CENP-E is exported into the periplasm of the bacteria, the periplasmic fraction of 
the bacteria can be isolated by cold osmotic shock in addition to other methods known to 
skill in the art. To isolate recombinant proteins from the periplasm, the bacterial cells 
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are centrifuged to form a pellet. The pellet is resuspended in a buffer containing 20% 
sucrose. To lyse the cells, the bacteria are centrifuged and the pellet is resuspended in 
ice-cold 5 mM MgS0 4 and kept in an ice bath for approximately 10 minutes. The cell 
suspension is centrifuged and the supernatant decanted and saved. The recombinant 
5 proteins present in the supernatant can be separated from the host proteins by standard 
separation techniques well known to those of skill in the art. 

B. Standard Protein Separation Techniques For Purifying CENP-E 
Solubility Fractionation 

10 Often as an initial step, particularly if the protein mixture is complex, an 

initial salt fractionation can separate many of the unwanted host cell proteins (or proteins 
derived from the cell culture media) from the recombinant protein of interest. The 
preferred salt is ammonium sulfate. Ammonium sulfate precipitates proteins by 
effectively reducing the amount of water in the protein mixture. Proteins then precipitate 

15 on the basis of their solubility. The more hydrophobic a protein is, the more likely it is 
to precipitate at lower ammonium sulfate concentrations. A typical protocol includes 
adding saturated ammonium sulfate to a protein solution so that the resultant ammonium 
sulfate concentration is between 20-30%. This concentration will precipitate the most 
hydrophobic of proteins. The precipitate is then discarded (unless the protein of interest 

20 is hydrophobic) and ammonium sulfate is added to the supernatant to a concentration 

known to precipitate the protein of interest. The precipitate is then solubilized in buffer 
and the excess salt removed if necessary, either through dialysis or diafiltration. Other 
methods that rely on solubility of proteins, such as cold ethanol precipitation, are well 
known to those of skill in the art and can be used to fractionate complex protein 

25 mixtures. 

Size Differential Filtration 

CENP-E has a known molecular weight and this knowledge can be used to 
isolated it from proteins of greater and lesser size using ultrafiltration through membranes 
30 of different pore size (for example, Amicon or Millipore membranes). As a first step, 
the protein mixture is ultrafiltered through a membrane with a pore size that has a lower 
molecular weight cut-off than the molecular weight of the protein of interest. The 
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retentate of the ultrafiltration is then ultrafiltered against a membrane with a molecular 
cut off greater than the molecular weight of the protein of interest. The recombinant 
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chromatographed as described below. 



Column Chromatography 

CENP-E can also be separated from other proteins on the basis of its size, 
net surface charge, hydrophobicity, and affinity for ligands. In addition, antibodies 
raised against proteins can be conjugated to column matrices and the proteins 
immunopurifled. All of these methods are well known in the art. It will be apparent to 
one of skill that chromatographic techniques can be performed at any scale and using 
equipment from many different manufacturers (e.g., Pharmacia Biotech). 

V. Immunological Detection of CENP-E 

In addition to the detection of CENP-E genes and gene expression using 
nucleic acid hybridization technology, one can also use immunoassays to detect CENP-E. 
Immunoassays can be used to qualitatively or quantitatively analyze CENP-E. A general 
overview of the applicable technology can be found in Harlow & Lane, Antibodies: A 
Laboratory Manual (1988). 



A. Antibodies to CENP-E 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with CENP-E are known to those of skill in the art (see, e.g. , Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, supra; Goding, Monoclonal 
Antibodies: Principles and Practice (2d ed. 1986); and Kohler & Milstein, Nature, 
256:495-497 (1975). Such techniques include antibody preparation by selection of 
antibodies from libraries of recombinant antibodies in phage or similar vectors, as well as 
preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice (see, 
e.g., Huse et al., Science 246:1275-1281 (1989); Ward et al, Nature 341:544-546 
(1989)). 

A number of CENP-E comprising immunogens may be used to produce 
antibodies specifically reactive with CENP-E. For example, recombinant XCENP-E or a 
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antigenic fragment thereof such as the motor or tail domain, is isolated as described 
herein. Recombinant protein can be expressed in eukaryotic or prokaryotic cells as 
described above, and purified as generally described above. Recombinant protein is the 
preferred immunogen for the production of monoclonal or polyclonal antibodies. 
Alternatively, a synthetic peptide derived from the sequences disclosed herein and 
conjugated to a carrier protein can be used an immunogen. Naturally occurring protein 
may also be used either in pure or impure form. The product is then injected into an 
animal capable of producing antibodies. Either monoclonal or polyclonal antibodies may 
be generated, for subsequent use in immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of skill 
in the art. An inbred strain of mice or rabbits is immunized with the protein using a 
standard adjuvant, such as Freund's adjuvant, and a standard immunization protocol. The 
animal's immune response to the immunogen preparation is monitored by taking test 
bleeds and determining the titer of reactivity to CENP-E. When appropriately high titers 
of antibody to the immunogen are obtained, blood is collected from the animal and 
antisera are prepared. Further fractionation of the antisera to enrich for antibodies 
reactive to the protein can be done if desired (see Harlow & Lane, supra). 

Monoclonal antibodies may be obtained by various techniques familiar to 
those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of immortalization 
include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 
screened for production of antibodies of the desired specificity and affinity for the 
antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced 
by various techniques, including injection into the peritoneal cavity of a vertebrate host. 
Alternatively, one may isolate DNA sequences which encode a monoclonal antibody or a 
binding fragment thereof by screening a DNA library from human B cells according to 
the general protocol outlined by Huse et al, Science 246:1275-1281 (1989). 

Monoclonal antibodies and polyclonal sera are collected and titered against 
the immunogen protein in an immunoassay, for example, a solid phase immunoassay with 
the immunogen immobilized on a solid support. Polyclonal antisera with a titer of 10 4 or 
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greater are selected and tested for their cross reactivity against non-CENP-E proteins or 
even other homologous proteins from other organisms, using a competitive binding 
immunoassay. Specific polyclonal aniiseia auu monoclonal antibodies will usually bind 
with a K D of at least about 0.1 mM, more usually at least about 1 /iM, preferably at 

5 least about 0.1 /*M or better, and most preferably, 0.01 /xM or better. 

Once CENP-E specific antibodies are available, CENP-E can be detected 
by a variety of immunoassay methods. For a review of immunological and immunoassay 
procedures, see Basic and Clinical Immunology (Stites & Terr eds., 7th ed. 1991). 
Moreover, the immunoassays of the present invention can be performed in any of several 

10 configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, ed., 
1980); and Harlow & Lane, supra. 

B. Immunological Binding Assays 

As explained above, CENP-E expression is associated with mitosis. Thus, 

15 CENP-E provides a marker with which to examine actively dividing cells, including 

pathological cells such as cancers or hyperplasias. In a preferred embodiment, CENP-E 
is detected and/or quantified using any of a number of well recognized immunological 
binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 4,517,288; and 
4,837,168). For a review of the general immunoassays, see also Methods in Cell Biology 

20 Volume 37: Antibodies in Cell Biology (Asai, ed. 1993); Basic and Clinical Immunology 
(Stites & Terr, eds., 7th ed. 1991). Immunological binding assays (or immunoassays) 
typically utilize a "capture agent" to specifically bind to and often immobilize the 
analyte (in this case the CENP-E or antigenic subsequence thereof). The capture agent is 
a moiety that specifically binds to the analyte. The antibody (anti-CENP-E) may be 

25 produced by any of a number of means well known to those of skill in the art and as 
described above. 

Immunoassays also often utilize a labeling agent to specifically bind to and 
label the binding complex formed by the capture agent and the analyte. The labeling 
agent may itself be one of the moieties comprising the antibody/analyte complex. Thus, 
30 the labeling agent may be a labeled CENP-E polypeptide or a labeled anti-CENP-E 
antibody. Alternatively, the labeling agent may be a third moiety, such as another 
antibody, that specifically binds to the antibody/CENP-E complex. 
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In a preferred embodiment, the labeling agent is a second CENP-E bearing 
a label. Alternatively, the second antibody may lack a label, but it may, in turn, be 
bound by a labeled third antibody specific to antibodies of the species from which the 
second antibody is derived. The second can be modified with a detectable moiety, such 
5 as biotin, to which a third labeled molecule can specifically bind, such as enzyme-labeled 
streptavidin. 

Other proteins capable of specifically binding immunoglobulin constant 
regions, such as protein A or protein G may also be used as the label agent. These 
proteins are normal constituents of the cell walls of streptococcal bacteria. They exhibit 

10 a strong non-immunogenic reactivity with immunoglobulin constant regions from a 

variety of species (see generally Kronval, et aL, J. Immunol, 111: 1401-1406 (1973); 
Akerstrom, et aL, 7. Immunol., 135: 2589-2542 (1985)). 

Throughout the assays, incubation and/or washing steps may be required 
after each combination of reagents. Incubation steps can vary from about 5 seconds to 

15 several hours, preferably from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, analyte, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 10°C 
to 40°C. 

20 

Non-Competitive Assay Formats 

Immunoassays for detecting CENP-E in samples may be either competitive 
or noncompetitive. Noncompetitive immunoassays are assays in which the amount of 
captured analyte (in this case the protein) is directly measured. In one preferred 

25 "sandwich" assay, for example, the capture agent (anti-CENP-E antibodies) can be bound 
directly to a solid substrate on which they are immobilized. These immobilized 
antibodies then capture CENP-E present in the test sample. CENP-E is thus immobilized 
is then bound by a labeling agent, such as a second CENP-E antibody bearing a label. 
Alternatively, the second antibody may lack a label, but it may, in turn, be bound by a 

30 labeled third antibody specific to antibodies of the species from which the second 
antibody is derived. The second can be modified with a detectable moiety, such as 
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biotin, to which a third labeled molecule can specifically bind, such as enzyme- labeled 
streptavidin. 

Com petitive assay formats 

In competitive assays, the amount of CENP-E (analyte) present in the 
sample is measured indirectly by measuring the amount of an added (exogenous) analyte 
(i.e the CENP-E) displaced (or competed away) from a capture agent (anti-CENP-E 
antibody) by the analyte present in the sample. In one competitive assay, a known 
amount of, in this case, the CENP-E is added to the sample and the sample is then 
contacted with a capture agent, in this case an antibody that specifically binds to the 
CENP-E. The amount of CENP-E bound to the antibody is inversely proportional to the 
concentration of CENP-E present in the sample. In a particularly preferred embodiment, 
the antibody is immobilized on a solid substrate. The amount of the CENP-E bound to 
the antibody may be determined either by measuring the amount of CENP-E present in 
an CENP-E/antibody complex, or alternatively by measuring the amount of remaining 
uncomplexed protein. The amount of CENP-E may be detected by providing a labeled 
CENP-E molecule. 

A hapten inhibition assay is another preferred competitive assay. In this 
assay a known analyte, in this case CENP-E, is immobilized on a solid substrate. A 
known amount of anti-CENP-E antibody is added to the sample, and the sample is then 
contacted with the immobilized CENP-E. The amount of anti-CENP-E antibody bound 
to the immobilized CENP-E is inversely proportional to the amount of CENP-E present 
in the sample. Again, the amount of immobilized antibody may be detected by detecting 
either the immobilized fraction of antibody or the fraction of the antibody that remains in 
solution. Detection may be direct where the antibody is labeled or indirect by the 
subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

Immunoassays in the competitive binding format can be used for 
crossreactivity determinations. For example, a protein partially encoded by SEQ ID 
NO: 1 can be immobilized to a solid support. Proteins are added to the assay that 
compete with the binding of the antisera to the immobilized antigen. The ability of the 
above proteins to compete with the binding of the antisera to the immobilized protein is 
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compared to CENP-E encoded by SEQ ID NO:l. The percent crossreactivity for the 
above proteins is calculated, using standard calculations. Those antisera with less than 
10% crossreactivity with each of the proteins listed above are selected and pooled. The 
cross-reacting antibodies are optionally removed from the pooled antisera by 
immunoabsorption with the considered proteins, e.g., distantly related homologues. 

The immunoabsorbed and pooled antisera are then used in a competitive 
binding immunoassay as described above to compare a second protein, thought to be 
perhaps the protein of this invention, to the immunogen protein (i.e., CENP-E of SEQ ID 
NO: 1). In order to make this comparison, the two proteins are each assayed at a wide 
range of concentrations and the amount of each protein required to inhibit 50% of the 
binding of the antisera to the immobilized protein is determined. If the amount of the 
second protein required to inhibit 50% of binding is less than 10 times the amount of the 
protein partially encoded by SEQ ID NO:l that is required to inhibit 50% of binding, 
then the second protein is said to specifically bind to the polyclonal antibodies generated 
to an CENP-E immunogen. 

Other assay formats 

Western blot (immunoblot) analysis is used to detect and quantify the 
presence of CENP-E in the sample. The technique generally comprises separating 
sample proteins by gel electrophoresis on the basis of molecular weight, transferring the 
separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon 
filter, or derivatized nylon filter), and incubating the sample with the antibodies that 
specifically bind the CENP-E. The anti-CENP-E antibodies specifically bind to the 
CENP-E on the solid support. These antibodies may be directly labeled or alternatively 
may be subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse 
antibodies) that specifically bind to the anti-CENP-E antibodies. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
techniques (see Monroe et al., Amer. Clin. Prod. Rev. 5:34-41 (1986)). 
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Reduction of non-specific binding 

One of skill in the art will appreciate that it is often desirable to minimize 

Particularly where the assav involves an antigen 
or antibody immobilized on a solid substrate it is desirable to minimize the amount of 
non-specific binding to the substrate. Means of reducing such non-specific binding are 
well known to those of skill in the art. Typically, this technique involves coating the 
substrate with a proteinaceous composition. In particular, protein compositions such as 
bovine serum albumin (BSA), nonfat powdered milk, and gelatin are widely used with 
powdered milk being most preferred. 

Labels 

The particular label or detectable group used in the assay is not a critical 
aspect of the invention, as long as it does not significantly interfere with the specific 
binding of the antibody used in the assay. The detectable group can be any material 
having a detectable physical or chemical property. Such detectable labels have been 
well-developed in the field of immunoassays and, in general, most any label useful in 
such methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical or chemical means. Useful labels in the present invention include magnetic beads 
(e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein isothiocyanate, Texas red, 
rhodamine, and the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes (e.g., 
horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), 
and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., 
polystyrene, polypropylene, latex, etc.) beads. 

The label may be coupled directly or indirectly to the desired component 
of the assay according to methods well known in the art. As indicated above, a wide 
variety of labels may be used, with the choice of label depending on sensitivity required, 
ease of conjugation with the compound, stability requirements, available instrumentation, 

and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or 
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covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
compound, or a chemi luminescent compound. A number of ligands and anti-ligands can 
be used in conjunction with the labeled, naturally occurring anti-ligands. Alternatively, 
any haptenic or antigenic compound can be used in combination with an antibody. 

5 The molecules can also be conjugated directly to signal generating 

compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest as 
labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, 
or oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and 
its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 

10 Chemi luminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., 
luminol. For a review of various labeling or signal producing systems which may be 
used, see U.S. Patent No. 4,391,904. 

Means of detecting labels are well known to those of skill in the art. 
Thus, for example, where the label is a radioactive label, means for detection include a 

15 scintillation counter or photographic film as in autoradiography. Where the label is a 
fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such 
as charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic 

20 labels may be detected by providing the appropriate substrates for the enzyme and 
detecting the resulting reaction product. Finally simple colorimetric labels may be 
detected simply by observing the color associated with the label. Thus, in various 
dipstick assays, conjugated gold often appears pink, while various conjugated beads 
appear the color of the bead. 

25 Some assay formats do not require the use of labeled components. For 

instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 



30 
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VI. Assays for modulators of CENP-E 

CENP-E is a plus end-directed microtubule motor that is required for 

• . «-ni- _ * : ±1 „..:J„r. *Ua flwnt tima kir»lrvrr ir»r> 11 1/ IPtlVP PFMP-F 

The activity of CENP-E can be assessed using a variety of in vitro assays, e.g., 
5 microtubule gliding assays (see Example II) or ATPase assays (Kodama et aL, 7. 

Biochem. 99: 1465-1472 (1986); Stewart et aL, Proc. Nat'lAcad. Sci. USA 90: 5209- 
5213 (1993). Microtubule depolymerization assays can also be used to examine CENP-E 
activity (Lombilto et aL, J. Cell Biol. 128:107-115 (1995)). 

In addition, CENP-E activity can be examined by comparing antibody 
10 depletion of CENP-E or inhibition of CENP-E in vitro using cultured cells or egg 

extracts. Samples that have been depleted or inhibited are compared to control samples 
that are not inhibited/depleted or that have biologically CENP-E added back to the 
sample. Characteristics such as spindle assembly and metaphase arrest are used to 
compare the effect of CENP-E inhibition or depletion. 
15 Such assays can be used to test for the activity of CENP-E isolated from 

endogenous sources or recombinant sources. Furthermore, such assays can be used to 
test for modulators of CENP-E. Because the plus end-directed microtubule motor activity 
of CENP-E is essential for mitosis, inhibition of CENP-E can be used to control cell 
proliferation. 

20 Candidate agents encompass numerous chemical classes, though typically 

they are organic molecules, preferably small organic compounds having a molecular 
weight of more than 100 and less than about 2,500 daltons. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, or ionic (electrostatic) interactions and typically include at least an amine, 

25 carbonyl, hydroxyl, sulfonyl or carboxyl group, preferably at least two of the functional 
chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the 
above functional groups. Candidate agents are also found among biomolecules including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 

30 analogs or combinations thereof. Candidate agents are obtained from a wide variety of 
sources including libraries of synthetic or natural compounds. For example, numerous 
means are available for random and directed synthesis of a wide variety of organic 
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compounds and biomolecules, including expression of randomized oligonucleotides. 

Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and 

animal extracts are available or readily produced. Additionally, natural or synthetically 

produced libraries and compounds are readily modified through conventional chemical, 
5 physical and biochemical means. Known pharmacological agents may be subjected to 

directed or random chemical modifications, such as acylation, alkylation, esterification, 

amidification to produce structural analogs. 

In an embodiment provided herein, the candidate bioactive agents are 

proteins. The protein may be made up of naturally occurring amino acids and peptide 
10 bonds, or synthetic peptidomimetic structures. For example, homo-phenylalanine, 

citrulline and noreleucine are considered amino acids for the purposes of the invention. 

"Amino acid" also includes imino acid residues such as proline and hydroxyproline. 

The side chains may be in either the (R) or the (S) configuration. In the preferred 

embodiment, the amino acids are in the (S) or L-configuration. If non-naturally 
15 occurring side chains are used, non-amino acid substituents may be used, for example to 

prevent or retard in vivo degradations. 

In another embodiment, the candidate bioactive agents are naturally 

occurring proteins or fragments of naturally occurring proteins. Thus, for example, 

cellular extracts containing proteins, or random or directed digests of proteinaceous 
20 cellular extracts, may be used. In one embodiment, the libraries are of bacterial, fungal, 

viral, and mammalian proteins, with the latter being preferred, and human proteins being 

especially preferred. 

In one embodiment, the candidate agents are peptides of from about 5 to 
about 30 amino acids, with from about 5 to about 20 amino acids being preferred, and 
25 from about 7 to about 15 being particularly preferred. The peptides may be digests of 
naturally occurring proteins as is outlined above, or random peptides. By randomized or 
grammatical equivalents herein is meant that each nucleic acid and peptide consists of 
essentially random nucleotides and amino acids, respectively. Since generally these 
random peptides (or nucleic acids, discussed below) are chemically synthesized, they may 
incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all 
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or most of the possible combinations over the length of the sequence, thus forming a 
library of randomized candidate bioactive proteinaceous agents. 

in uuc cuiuuuiuiciu, uic ivviaiy la tuny laiiuuiinz-ou, wim uu a^uwiiwv 

preferences or constants at any position. In a preferred embodiment, the library is 
5 biased. That is, some positions within the sequence are either held constant, or are 

selected from a limited number of possibilities. For example, in a preferred embodiment, 
the nucleotides or amino acid residues are randomized within a defined class, for 
example, of hydrophobic amino acids, hydrophilic residues, sterically biased (either small 
or large) residues, towards the creation of cysteines, for cross-linking, prolines for SH-3 
10 domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

In another embodiment, the candidate agents are nucleic acids. By nucleic 
acid or "oligonucleotide" or grammatical equivalents herein means at least two 
nucleotides covalently linked together. A nucleic acid of the present invention will 

15 generally contain phosphodiester bonds, although in some cases, as outlined below, 
nucleic acid analogs are included that may have alternate backbones, comprising, for 
example, phosphoramide (Beaucage et al, Tetrahedron 49(10): 1925 (1993) and 
references therein; Letsinger, 7. Org, Chem. 35:3800 (1970); Sprinzl et al, Eur. J. 
Biochem. 81:579 (1977); Letsinger et al, Nucl. Acids Res. 14:3487 (1986); Sawai et al, 

20 Chem. Lett. 805 (1984), Letsinger et al, J. Am. Chem. Soc. 110:4470 (1988); and 

Pauwels et ai, Chemica Scripta 26:141 91986)), phosphorothioate (Mag et ai, Nucleic 
Acids Res. 19:1437 (1991); and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et 
al, J. Am. Chem. Soc. 111:2321 (1989), O-methylphophoroamidite linkages (see 
Eckstein, Oligonucleotides and Analogues: A Practical Approach, Oxford University 

25 Press), and peptide nucleic acid backbones and linkages (see Egholm, J. Am. Chem. Soc. 
114:1895 (1992); Meier etal, Chem. Int. Ed. Engl. 31:1008 (1992); Nielsen, Nature, 
365:566 (1993); Carlsson et ai, Nature 380:207 (1996), all of which are incorporated by 
reference). Other analog nucleic acids include those with positive backbones (Denpcy et 
ai, Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic backbones (U.S. Patent Nos. 

30 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; Kiedrowshi et ai, Angew. 
Chem. Intl. Ed. English 30:423 (1991); Letsinger etal, J. Am. Chem. Soc. 110:4470 
(1988); Letsinger et al, Nucleoside & Nucleotide 13:1597 (1994); Chapters 2 and 3, 
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ASC Symposium Series 580, Carbohydrate Modifications in Antisense Research, Ed. 
Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal Chem. Lett. 
4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 37:743 
(1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, 
Carbohydrate Modifications in Antisense Research, Ed. Y.S. Sanghui and P. Dan Cook. 
Nucleic acids containing one or more carbocyclic sugars are also included within the 
definition of nucleic acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp. 169-176). 
Several nucleic acid analogs are described in Rawls, C & E News June 2, 1997 page 35. 
All of these references are hereby expressly incorporated by reference. These 
modifications of the ribose-phosphate backbone may be done to facilitate the addition of 
additional moieties such as labels, or to increase the stability and half-life of such 
molecules in physiological environments. In addition, mixtures of naturally occurring 
nucleic acids and analogs can be made. Alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 
The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. The nucleic acid may be 
DNA, both genomic and cDNA, RNA or a hybrid, where the nucleic acid contains any 
combination of deoxyribo-and ribo-nucleotides, and any combination of bases, including 
uracil, adenine, thymine, cytosine, guanine, inosine, xanthine, hypoxanthine, 
isocytosine, isoguanine, etc. 

As described above generally for proteins, nucleic acid candidate agents 
may be naturally occurring nucleic acids, random nucleic acids, or biased random nucleic 
acids. For example, digests of procaryotic or eukaryotic genomes may be used as is 
outlined above for proteins. 

In a preferred embodiment, the candidate bioactive agents are organic 
chemical moieties, a wide variety of which are available in the literature. 

In a preferred embodiment, the candidate agent is a small molecule. The 
small molecule is preferably 4 kilodaltons (kd) or less. In another embodiment, the 
compound is less than 3 kd, 2kd or lkd. In another embodiment the compound is less 
than 800 daltons (D), 500 D, 300 D or 200 D. Alternatively, the small molecule is about 
75 D to 100 D, or alternatively, 100 D to about 200 D. 
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The modulators that are identified herein may be useful as lead compounds 
for therapeutics, bioagricultural compounds, or diagnostics. A therapeutic as used herein 

r . J ...U-.-L. :. U»i:~. .~A k n M^xkla nf mrtWnloti nor PPMP-F in VtVO 
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which can have treatment application in both human and animal disease. Modulation of 

5 CENP-E would be desirable in a number of conditions including but not limited to: 
abnormal stimulation of endothelial cells (e.g., atherosclerosis), solid and hematopoetic 
tumors and tumor metastasis, benign tumors, for example, hemangiomas, acoustic 
neuromas, neurofibromas, vascular malfunctions, abnormal wound healing, inflammatory 
and immune disorders such as Rheumatoid Arthritis, Bechet's disease, gout or gouty 

10 arthritis, abnormal angiogenesis accompanying: rheumatoid arthritis, psoriasis, diabetic 
retinopathy, and other ocular angiogenic diseases such as, macular degeneration, corneal 
graft rejection, corneal overgrowth, glaucoma, Osier Webber syndrome, cardiovascular 
diseases such as hypertension, cardiac ischemia and systolic and diastolic dysfunction and 
fungal diseases such as aspergillosis, candidiasis and topical fungal diseases such as tinea 

15 pedis. 

A bioagricultural compound as used herein refers to a chemical or 
biological compound that has utility in agriculture and functions to foster food or fiber 
crop protection or yield improvement. For example, one such compound may serve as a 
herbicide to selectively control weeds, as a fungicide to control the spreading of plant 
20 diseases, as an insecticide to ward off and destroy insect and mite pests. In addition, one 
such compound may demonstrate utility in seed treatment to improve the growth 
environment of a germinating seed, seedling or young plant as a plant regulator or 
activator. 

A diagnostic as used herein is a compound that assists in the identification and 
25 characterization of a health or disease state in humans or other animals. The diagnostic 
can be used in standard assays as is known in the art. 

The modulators can be applied to generally any cell type wherein CENP-E 
modulation is desired, i.e., eukaryotic, single celled and multicelled organisms, plant and 
animal, vertebrate, invertebrate and mammalian. 
30 Modulators of CENP-E are tested using biologically active CENP-E, 

preferably biologically active XCENP-E. Modulation is tested using one of the in vitro 
assays described above, e.g., ATPase, microtubule binding and/or gliding, spindle 
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assembly, and metaphase arrest. It is understood that any of the assays can be repeated 
or different types of assays can be used on the same candidate agent to further 
characterize the candidate agent as a CENP-E modulator. In particular, where more than 
one candidate agent is used, the assay can be repeated using individual candidate agents. 
Moreover, where a candidate agent is a lead compound, further assays can be performed 
to optimize results until it is established whether that compound or one similar thereto has 
the desired effect. 

As described above, CENP-E is also a useful diagnostic tool in vitro for 
determining when a cell is entering mitosis. Reversible inhibitors of CENP-E can be 
used to synchronize cells in culture. Fungal homologues of XCENP-E also provide a 
diagnostic tool for identifying fungal infections. 

The present invention also provides for kits for screening for modulators of 
CENP-E. Such kits can be prepared from readily available materials and reagents. For 
example, such kits can comprise any one or more of the following materials: biologically 
active CENP-E, reaction tubes, and instructions for testing CENP-E activity. Preferably, 
the kit contains biologically active XCENP-E. A wide variety of kits and components 
can be prepared according to the present invention, depending upon the intended user of 
the kit and the particular needs of the user. For example, the kit can be tailored for 
ATPase assays or microtubule gliding assays. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
or scope of the appended claims. 



EXAMPLES 

The following examples are provided by way of illustration only and not 
by way of limitation. Those of skill in the art will readily recognize a variety of 



WO 99/13061 



PCT/US98/1923I 



44 

noncritical parameters that could be changed or modified to yield essentially similar 
results. 



Methods 

5 A. Isolation ofXCENP-E cDNA and DNA constructs 

Fragments spanning nucleotides 1-1707 and 6376-8080 of human CENP-E 
cDNA (Yen, et al., Nature 359:536-539 (1992)) were used to screen a XgtlO adult 
Xenopus ovary cDNA library (Rebagliati, et aL f Cell 42:169-711 (1985)), hybridizing at 
42°C according to Church & Gilbert (Church, et aL, Proc. Natl. Acad. Sci. USA 

10 81:1991-1995 (1984)). cDNA clones hybridizing to both probes were isolated and used 
in combination to isolate overlapping cDNA clones spanning the intervening region. The 
sequence of both cDNA strands was determined by a combination of automated cycle 
sequencing (Applied Biosystems, Perkin Elmer) and manual sequencing using Sequenase 
version 2.0 (USB). 

15 Overlapping regions of the various cDNA clones were often not absolutely 

identical, but displayed single base differences at multiple positions. One clone encoding 
the N-terminal region of the rod domain contained 27 additional nucleotides relative to 
one other clone spanning that region. Single base differences between cDNA clones 
were attributed to polymorphisms present in the outbred psuedotetraploid source material 

20 used to construct the cDNA library. The 27 nucleotide relative insertion may be a 
polymorphism, or may represent an alternatively spliced XCENP-E isoform. 
Overlapping sequence was compiled using MacVector software (Kodak Scientific Imaging 
Systems, Rochester, NY). 

25 B. Expression and purification of XCENP-E motor domain 

Recombinant CENP-E was prepared in order to test its activity in a 
microtubule gliding assay. The recombinant XCENP-E was prepared as a fusion protein 
inducible by IPTG, with a c-myc epitope tag. 

First, an XCENP-E was cloned into an expression plasmid. The 5 T 
30 untranslated region of XCENP-E was removed by PCR using two primers: SEQ ID NO:9 
CATATGACCATGGCCGAGGGAGATGCAG and SEQ ID NO: 10 
GTCAGGTCAGCAACATACACG. These primers were used to amplify the 5' end of 
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the X-CENP-E cDNA and introduce Ndel and Ncol sites adjacent to and at the start 
codon, respectively. This PCR product was subcloned into pCRII (InVitrogen) and then 
joined at the Nrul site with a portion of the XCENP-E cDNA, to reconstruct a motor 
domain encoding cDNA with an altered start codon. 

An Ncol-Xhol fragment spanning nucleotides 143-1939 was excised from 
the reconstructed cDNA fragment and ligated into Ncol/Xhol cut pET23d (Novagen) to 
yield pET23dXCE. This plasmid was digested with BsrGI and Xhol, blunted with 
Klenow, and a Klenow blunted 60 bp HincII-EcoRI fragment from pBSKS + myc (gift 
from S. Michealis) was ligated to the digested pET23dXCE backbone in the presence of 
BsrGI to bias orientation of the insert. The resulting plasmid, pET23dXCEMycHis, 
encodes amino acids 1-473 of XCENP-E linked at the C-terminus to the following 
sequence: SEQ ID NO: 11 TVSISLGDLTME^KLISEEDLNFEHHHHHH. The c-myc 

epitope is underlined. 

This plasmid was transformed into E. coli strain BL21 (DE3) pLysS. A 
culture inoculated with a single colony was gown at 37°C in a modified LB medium (10 
g bactotryptone, 5 g yeast extract, 5 g NaCl, 2 g MgS0 4 , 1 g casaminoacids per liter, 
and 200 mg ampicillin per liter) to OD 600 of -1. The cultures were allowed to cool to 
room temperature and expression of fusion protein was induced with 0.5 mM IPTG at 
room temperature. After induction, the cells were used immediately to prepare fusion 
protein. Cells that were pelleted and stored at low temperatures prior to protein isolation 
gave low to no yield of active protein, due to CENP-E sensitivity to denaturation. 

Cells were harvested 4 hours after induction, immediately resuspended in 
lysis buffer (50 mM Tris-HCl pH 7.5, 50 mM NaCl, 1 mM PMSF, 0.1 mM ATP), and 
lysed by 3 passages through a trench press. Insoluble debris was removed by a 
centrifugation at 35,000 rpm in Sorvall T647.5 rotor for 40 minutes at 4°C. Soluble 
protein in the supernatant was bound in batch to 0.5 ml of NTI-agarose resin (Qiagen) 
for 15 minutes at 4°C. The resin was placed in a column, washed with 5 ml of the lysis 
buffer supplemented with 20 mM Imidazole. XCENP-E fusion protein recovered by 
elution in lysis buffer containing 100 mM imidazole and 1 mM DTT. A typical yield 
was ~2 mg of soluble XCENP-E protein from 1 liter of bacterial culture. Freshly 
prepared protein was used to assay motility. Incubation of bacterially expressed 
XCENP-E motor protein for longer than 24 hours at 4°C led to loss of motility. 
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C. Fusion protein expression, antibody production, and immunoblotting 

The tail and rod regions (see Figure 1) of XCENP-E were used to make 
ontik^ipc tn PFMP.F Antio^TK for ✓v-YPRNP-F^ * „ faa 2396-2954^ and a-XCENP- 
Erod 826-1106) were produced in E. coli strain BL21 (DE3) pLysS as hexahistidine 

5 fusion proteins using the pRSETB expression plasmid (InVitrogen). Following induction 
with IFTG for 4-16 hours, bacteria were pelleted, washed and lysed by rapid freeze thaw 
followed by sonication. Inclusion bodies containing the fusion proteins were purified and 
solubilized in 8M urea, 0.1 M sodium phosphate pH 8.0. 

a-XCENP-E R0D fusion protein was further purified over Ni-NTA agarose 

10 (Qiagen) according to the manufacturer's instructions. a-XCENP-E TADL protein was 

isolated from preparative SDS-PAGE gels as described in (Harlow, et al, Antibodies, A 
Laboratory Manual: Cold Spring Harbor Laboratory) (1988)). These antigens were used 
to raise polyclonal antibodies in rabbits. 

For affinity purification, antigen was coupled to cyanogen bromide 

15 activated Sepharose (Pharmacia) according to the manufacturers instructions. Antibodies 
were purified (Harlow, et al, Antibodies, A Laboratory Manual (1988)), eluting with 0.2 
M glycine pH 2.5. Antibodies were dialyzed into 10 mM K-HEPES pH 7.8, 100 mM 
KC1, 1 mM MgCl 2 and concentrated using prerinsed centricon spin concentrators 
(Amicon, Beverly, MA) or Nanospin filter concentrators (Gelman Sciences, Ann Arbor, 

20 MI). 

For immunoblots, cytoplasmic extract prepared from metaphase H arrested 
Xenopus eggs (Murray, in Methods in Cell Biology, pp. 581-605 (Kay & Peng, eds., 
(1991)) was resolved on a 4% polyacrylamide gel (-50 /xg/lane), transferred to 
nitrocellulose and lanes individually probed with affinity purified a-XCENP-E TAIL or a- 
25 XCENP-E R0D . 

For localization of XCENP-E in cultured Xenopus XTC cells, 
asynchronous cultures of XTC cells were fixed in methanol and simultaneously stained 
with mouse monoclonal anti-a-tubulin antibody and affinity purified rabbit 
a-XCENP-E TAIL antibody. Chromatin was visualized by staining with DAPI. Selected 
30 cells at progressive stages of the cell cycle were examined on the blot: interphase, 
prophase, prometaphase, metaphase, anaphase, and telophase. Similar staining was 
observed using (x-XCENP-E R0D antibody. 
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Immunoblots were prepared as follows: proteins resolved by SDS-PAGE, 
transferred to nitrocellulose, blocked with TBS 5% nonfat dried milk (NFDM), and 
probed with 2 iig/ml affinity purified antibody overnight in TBS containing 0.05 % Tween 
(TBST) containing 5% NFDM. Primary antibody was visualized using 125 I-Protein A 
(Amersham) followed by autoradiography. Occasionally, instead of 125 I-protein A, 
alkaline phosphatase conjugated goat anti-rabbit secondary antibody (Promega) was used 
according to the manufacturers instructions. Quantitative phosphoimaging was performed 
using a Molecular Dynamics model 445 SI phophorimager. 

D. Spindle Assembly in vitro 

CSF-arrested extract is an Xenopus egg extract that is arrested in 
metaphase using cytostatic factor. CSF-arrested extract was prepared from Xenopus eggs 
essentially as described in Murray, in Methods in Cell Biology, pp. 581-605 (Kay & 
Peng, eds. 1991); Sawin, et al., J. Cell Biol. 112:925-940 (1991)). 10 mg/ml rhodamine 
labelled bovine brain tubulin (Hyman, et al, in Methods in Enzymology, pp. 478-485( 
Vallee, ed., 1992)) was added at a 1 /il/300 /xl of extract. 

Localization of XCENP-E was examined on mitotic spindles assembled in 
vitro. Tubulin, DAPI-stained chromatin, and a-XCENP-E TAIL staining was examined. 
Metaphase spindles were assembled in vitro by cycling CSF-arrested Xenopus egg extract 
containing Xenopus sperm chromatin through interphase and arresting at the following 
metaphase as described (Sawin, et al., J. Cell Biol. 112:925-940 (1991)). Rhodamine 
labelled tubulin was added to the extracts to visualize tubulin containing structures. 
Spindles were sedimented onto coverslips and stained with affinity purified 
a-XCENP-ET AIL antibody, followed by FITC-conjugated secondary antibody and DAPI. 

For immunodepletion of extract, 100 /xg of affinity purified a-XCENP-E 
antibody or non-immune rabbit IgG (Calbiochem, San Diego, CA) was bound to 30 ^1 
slurry of protein A Affiprep beads (BioRad, Hercules, CA) for 1 hour at 4°C in CSF-XB 
(Murray, 1991). Beads were sedimented, unbound antibody removed, and serially 
washed with CSF-XB, CSF-XB containing 0.5 M NaCl, and CSF-XB containing 
leupeptin, pepstatin A, and chymostatic (10 /xg/ml each), and cytochalasin B 10 tig/ml. 
100 /il of CSF extract was added to the beads and incubated rocking for 1 hour at 4°C. 
After sedimenting beads, depleted extract was removed and stored on ice until use. 



WO 99/13061 



PCT7US98/19231 * 



48 

Demembranated sperm prepared as described (Newmeyer, et aL, in 
Methods in Cell Biology, pp. 607-634 (Kay & Peng, eds. (1991)) were added to a portion 

/ml or»H pvit from m^tanha^p arrest induced at room 
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temperature by addition of CaCl 2 to 0.6-0.8 mM final concentration. Extracts were 
periodically monitored by fluorescence microscopic examination of 1 fi\ aliquots squashed 
under a coverslip (Murray, in Methods in Cell Biology, pp. 581-605 (Kay & Peng, eds., 
1991)). At 80 minutes following exit from metaphase one half volume of the appropriate 
extract was added and the reaction incubated for an additional 80-120 minutes. 

M-phase structures accumulating in extracts were scored at 160-200 
minutes total elapsed time. Both mock depleted and XCENP-E depleted extracts 
frequently failed to exit interphase, or failed to remain arrested at the second metaphase, 
probably as a consequence of experimental manipulation. Immunoprecipitates were 
washed 3 times with CSF-XB containing protease inhibitors and 0.1 % Triton X-100 and 
examined by SDS-PAGE and Coomassie staining. 

For Coomassie staining and a-XCENP-E R0D blot of a-XCENP-E 
immunoprecipitates, immunoprecipitates were prepared from CSF-arrested extract (~ 10 
mg total protein) using affinity purified a-XCENP-E TAIL antibody, affinity purified 
(a-XCENP-E R0D antibody, or non-immune rabbit IgG. Immunoprecipitates were gently 
washed three times with TBS containing 0.1% Triton-XlOO. 80% of each precipitate was 
resolved by SDS-PAGE on a 5-15% gel and proteins visualized by staining with 
Coomassie brilliant blue. 

For antibody addition experiments, purified anti-XCENP-E antibody or 
non-immune rabbit IgG (Calbiochem) at 10 mg/ml was added to CSF-arrested extract at a 
1:20 dilution, followed by demembranated sperm nuclei and CaCl 2 . 80 minutes later, 
when a half volume of CSF arrested extract was added, a proportional amount of the 
appropriate antibody was added as well. 

Representative structures formed in the presence of 0.5 mg/ml rabbit IgG 
and in the absence of added antibody, and in the presence of 0.5 mg/ml a-XCENP-E TAIL 
antibody were examined. Rabbit IgG and a-XCENP-Er^ (both at 10 mg/ml) were 
added to CSF-arrested metaphase Xenopus egg extract at a 1:20 dilution along with 
Xenopus sperm. Extracts were then cycled through interphase. At 80 minutes into 
interphase (prophase) a half volume of metaphase arrested extract containing 0.5 mg/ml 
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of the appropriate antibody was added. 80 minutes later structures were scored and 

images collected. 

Quantitation of structures formed in extract containing no antibody 
(n=138), extract containing 0.5 mg/ml non-immune rabbit IgG (n = 132), and extract 
containing 0.5 mg/ml a-XCENP-E^ (n=114) at 80 minutes after exit from interphase 
were examined. Structures present in the respective extracts were examined and scored 
as belonging to one of four categories: bipolar spindles with chromatin aligned at the 
metaphase plate; bipolar spindles with misaligned chromosomes; monopolar spindles, 
including radial asters, half spindles and chromosomes associated with microtubules with 
indeterminant organization; and or other, including multipolar structures and groups of 
chromosomes apparently unassociated with microtubules. 

E. Immunofluorescence Microscopy 

Extract containing mitotic spindles assembled in vitro was diluted 30-50 
fold in BRB80 (80 mM KPIPES, 6 mM MgCl 2 , 1 mM EGTA) containing 0.5% Triton 
X-100 and 30% glycerol. Spindles were sedimented at room temperature onto a 
coverslip through a 3 ml cushion of BRB80 containing 0.5% Triton X-100 and 40% 
glycerol at 7000 rpm in a Sorvall HS4 rotor. Coverslips were fixed in -20°C methanol, 
rehydrated in TBS-Tx (150 mM NaCl, 20 mM Tris pH 7.6, 0.1% Triton X-100), 
blocked for 1 hour with 1 % BSA in TBS-Tx and probed with 5 ng/m\ affinity purified 
antibody in 1% BSA in TBS-Tx. After washing with TBS-Tx, primary antibody was 
visualized using by FITC-conjugated secondary goat anti-rabbit antibody (Cappel). 

Xenopus XTC cells cultured on coverslips in 60% L15 medium containing 
10% fetal calf serum at room temperature in ambient atmosphere were rinsed in TBS, 
fixed in -20°C methanol and stained with affinity purified antibody as described above. 
Monoclonal anti-alpha tubulin antibody DM1 A (Sigma) was used at a dilution of 1:1000 

to stain microtubules. 

Fluorescent images were collected using a Princeton Instruments cooled 
CCD mounted on a Zeiss Axioplan microscope controlled by Metamorph software 
(Universal Imaging, West Chester, PA). Image processing was performed using both 
Metamorph and Adobe Photoshop software. 
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F. Preparation of polarity marked microtubules and motility assay 

Taxol stabilized microtubule seeds brightly labelled with rhodamine were 
prepared by incubating a !:! ratio of rhodamine labelled bovine brain tubulin (Byman ( pt 
aL, In Methods in Enzymology, pp. 478-485 (Vallee, ed., 1992)) with unlabelled bovine 
brain tubulin at a final tubulin concentration of 2.5 mg/ml in PEM80 (80 mM Pipes pH 
6.9, 1 mM EGTA, 1 mM MgCl 2 ) containing 10% glycerol, 1/xM taxol, 1 mM GTP at 
37°C for 15 minutes. This mixture was then diluted with 2.75 volumes of warm PEM80 
containing 20 fiM taxol and 2 mM GTP, and sheared by 5 passes through a Hamilton 
syringe. 

Dimly rhodamine-labelled extensions were grown from the brightly 
labelled seeds in PEM80 containing 1 mM GTP and 1.5 mg/ml tubulin cocktail consisting 
of a mixture of N-ethyl maleimide modified tubulin (Hyman, et al. f in Methods in 
Enzymology, pp. 478-485 (Vallee, ed., 1992)), unlabelled tubulin and rhodamine labelled 
tubulin at a ratio of 0.1/0.52/0.38 for 30 minutes at 37°C. The resulting suspension of 
polarity marked microtubules was diluted with PEM80 containing 10 \M taxol and used 
to test motility. 

25 n\ flow chambers prepared from cover slips sealed with an Apiezon 
grease, were preadsorbed with a 1:10 diluted mouse ascities fluid containing anti-myc 
monoclonal antibody 9EI0 (Evans, et al, Mol. Cell Biol. 5:3610-3616 (1985)), washed 
with 50 ix\ PEM80, incubated with XCENP-E motor protein diluted to 0.1 mg/ml, and 
unbound protein removed by rinsing with 50 /il of PEM80. A microtubule/ ATP mix 
consisting polarity marked microtubules in PEM80 containing 10 fiM taxol, 2 mM 
MgATP, and an oxygen scavenging system (0.1 mg/ml catalase, 0.03 mg/ml glucose 
oxidase, 10 mM glucose, 0.1% (3-mercaptoethanol (Kishino, et al, Nature 33:74-76 
(1989)) was then flowed into the chamber. 

Movement of microtubules was monitored at room temperature on a Zeiss 
Axioplan fluorescence microscope fitted with 63X Plan-Apochromat oil immersion 
objective, and a Princeton instruments cooled CCD. Automated time-lapse image 
acquisition and data analysis was performed using the MetaMorph software package 
(Universal Imaging, West Chester, PA). 
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Rxample I: Identification of Xenoous CENP-E 

To investigate the role of CENP-E in mitotic spindle formation in vitro 
using extracts of Xenopus eggs used low stringency hybridization followed by library 
rescreening was used to clone the Xenopus homologue of CENP-E. This clone was 
subsequently used to raise antibodies suitable for immunodepletion and antibody addition 
studies. The nucleotide sequence (SEQ ID NO:2) encodes a protein of 2954 amino acids 
with a predicted molecular mass of 340 kDa (SEQ ID NO:l, Figure 1C). The predicted 
structure of Xenopus CENP-E (XCENP-E) is similar to human CENP-E (hCENP-E), 
consisting of a 500 amino acid globular ami no-terminal domain containing a kinesin-like 
microtubule motor domain linked to a globular tail domain by a region predicted to form 
a long, discontinuous a-helical coiled coil (Lupas, et al, Science 252, 1162-1164 (1991); 
Berger, et al., Proc. Natl. Acad. Sci. USA 92:8259-8263 (1995)) (Figure 1A). Within 
the core of the motor domain (residues 1-324) XCENP-E and hCENP-E share 74% 
identity, significantly greater than that shared between XCENP-E and its nearest 
phylogenetic (evolutionary) neighbors (Moore, et al., Bioessays 18:207-219 (19%)). 
Outside the amino-terminal domain lie three additional regions which share greater than 
25% identity with human CENP-E, but not with other kinesin-like proteins (Figure 1). 
On the basis of these regions of identity and its large predicted size, the conclusion was 
made that XCENP-E is the Xenopus homologue of human CENP-E. 

Example II: XCENP-E is a Plus End-D irected Microtubule Motor 

Both human and Xenopus CENP-E are localized to the centromeres of 
mitotic chromosomes throughout all phases of chromosome movement. This localization 
places CENP-E in a position to mediate attachment of chromosomes to microtubules, 
movement of chromosomes during congression, and movement of chromosomes toward 
the spindle poles during anaphase A. 

To test directly if CENP-E is a microtubule motor and to determine the 
directionality of CENP-E movement, the amino-terminal 473 amino acids of XCENP-E, 
containing the kinesin-like motor domain, was fused at the C-terminus to 31 amino acids 
containing an 9 amino acid c-myc epitope tag followed by a hexahistidine tag (see Figure 
2A, and Figure 1C, amino-terminal boxed region). 
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This protein was produced in E. coli, and purified over nickel-agarose, 
yielding the expected 57 kDa polypeptide as the major product (Figure 2A, lane 1). 

t — ...:*u ~ mAnA/»1/Minl intihr\Hw /'QPIfYi /'Fvanc fit nl Mnl Cell 

lllliiiuiiiiuuiuuuig wiui a ci-iiijrv uiwuwiwiitti \^ *-*~/ \— . — — -» ~- — > 

fl/o/. 5:3610-3616 (1985)) confirmed the 57 kDa protein as the XCENP-E fusion protein 
(Figure 2B, lane 2, arrowheads). 

The XCENP-E fusion protein was tethered to a glass coverslip using the 
a-myc antibody and gliding of polarity marked microtubules containing brightly 
fluorescent rhodamine labelled seeds near their minus ends (Howard, et al, in Motility 
Assays for Motor Proteins, pp. 105-113 (Scholey, ed., 1993)) was recorded by time-lapse 
digital fluorescence microscopy. Representative time points demonstrating three 
examples of plus end-directed movement are presented in Figure 2B. Microtubules 
moved at a velocity of 5.1 /xM/min±1.7 (n=49) with brightly fluorescent seeds leading, 
indicating that the immobilized XCENP-E fusion protein was moving toward microtubule 
plus ends. No movement was observed in the absence of fusion protein. When assayed 
in the absence of a-myc antibody the XCENP-E fusion protein also supported 
microtubule gliding, albeit less robustly. 

This experiment demonstrates that CENP-E has plus-ended microtubule 
motor activity. Furthermore, by perturbing CENP-E function in Xenopus egg extracts, as 
shown below in Examples III-V, it was shown that congression in vitro requires a 
kinetochore-associated microtubule motor. This result contrasts with a prevailing model 
describing mitotic spindle formation in Xenopus egg extracts in vitro (Vernos, et al, 
Trends in Cell Biol. 5:297-301 (1995); Heald, et al, Nature 382, 420-5 (1996); Hyman, 
et al., Cell 84:401-410 (1996)). Both bipolar spindles with misaligned chromosomes and 
monopolar structures were observed when XCENP-E, a kinetochore-specific protein, was 
removed, or when XCENP-E function is impaired by addition of a-XCENP-E antibody 
(see Examples III-V below). These findings indicate that during normal mitotic spindle 
formation, CENP-E plays an essential role in mitotic spindle assembly and in 
prometaphase chromosome movements that result in metaphase chromosome alignment, 
via its activity as a plus-end directed microtubule motor activity. 
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Fxam ple III: XCENP-E associates with Xenopus c entromeres in vivo and in vitro 

To verify that Xenopus CENP-E exhibits a cell cycle-dependent 
kinetochore association, polyclonal antibodies were raised against two recombinant 
antigens, one spanning the tail and C-terminal portion of the rod (<*-XCENP-E TAIL , 
Figure IB) and the other corresponding to a portion of the N-terminus of the rod domain 
(a-XCENP-E ROD ; Figure IB). 

Immunoblotting of Xenopus egg extract reveals that the a-XCENP-E TArL 
antibody specifically recognizes XCENP-E as a single band of greater than 300 kDa. 
The a-XCENP-E ROD antibody specifically recognizes XCENP-E and another protein of 
slightly lower molecular weight that may be a distinct isoform of XCENP-E lacking the 
tail domain, or XCENP-E that has lost its tail domain as a result of partial proteolysis. 

Immunostaining of cultured Xenopus XTC cells using a-XCENP-E TAIL 
antibody revealed patterns of cell cycle-dependent localization similar to that observed for 
mammalian CENP-E (Yen, et al, Nature 359:536-539 (1992); Brown, et al., J. Cell Sci. 
109:961-969 (1996)) with the exception that during interphase XCENP-E was localized to 
the nucleus, consistent with the presence of a nuclear localization signal (Boulikas, et al.. 
Gene Express. 3:193-227 1993)) at the C-terminal end of the rod domain (Figure 1A, 
NLS, and 1C underlined sequence, RKKTK). Nuclear staining intensity was variable 
from cell to cell, probably reflecting different levels of XCENP-E accumulation, as 
observed for cytoplasmic CENP-E staining of interphase human cells (Yen, et al, Nature 
359:536-539 (1992); Brown, et al., J. Cell. Biol. 125:1303:1312 (1994)). 

Early in prometaphase XCENP-E localizes to discrete spots associated with 
condensed mitotic chromosomes. During metaphase and early anaphase, XCENP-E 
remains in discrete foci on chromosomes, and is also apparent at the spindle poles. 
XCENP-E is found at the spindle midzone during late anaphase and telophase. 
a-XCENP-EjA^ immunostaining of metaphase spindles assembled using cytostatic factor 
(CSF)-arrested Xenopus egg extracts cycled through interphase (Murray, in Methods in 
Cell Biology, pp. 581-605 (Kay & Peng, eds. 1991); Sawin, et al., J. Cell Biol. 
112:925-940 (1991)) revealed that XCENP-E was also associated with kinetochores 
assembled in vitro. Similar patterns of staining were observed in XTC cells and on 
spindles assembled in vitro using a-XCENP-E R0D antibody. 
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Example IV: XCENP-E is Required for Congression 

To determine the aspect(s) of mitosis for which XCENP-E is required, a- 
yprmp-P antihnHv (mid* tn the. tail domain of XCENP-EO was used to deolete 

-*.v^*-* »* ~ i ALL — — j \ -~ • 

XCENP-E from Xenopus egg extracts arrested in metaphase (CSF-extract). 

5 Immunoblotting of control and XCENP-E depleted CSF-extracts revealed that greater 
than 95% of XCENP-E could be removed by immunodepletion with this antibody. 
Unrelated antigens, such as XNuMA (Merdes, et ai f Cell 87:447-458 (1996)), were 
unaffected by depletion of XCENP-E . 

To examine the effects of immunodepletion on spindle assembly and 

10 chromosome movement, demembranated Xenopus sperm nuclei were added to 

undepleted, mock depleted and XCENP-E depleted CSF-extracts. Extracts were released 
from CSF-imposed metaphase arrest by addition of calcium and allowed to cycle through 
interphase and into the subsequent M-phase, whereupon an additional aliquot of the 
appropriate uncycled, metaphase-arrested XCENP-E depleted, mock depleted or 

15 undepleted extract was added to re-impose a metaphase arrest, thus allowing the 
accumulation of M-phase structures. 

While mock depleted and undepleted extracts yielded predominantly 
bipolar spindles with chromosomes aligned at the metaphase plate, depletion of 
XCENP-E produced a five-fold increase in the number of bipolar spindles with 

20 misaligned chromosomes, as well as smaller increase in the percentage of monopolar 
structures, including radial asters, half spindles, and chromosomes associated with 
microtubules with indeterminate organization. Extended incubation failed to alter the 
proportion of bipolar spindles with properly aligned chromosomes. This finding, and the 
presence of chromosomes resembling non-disjoined metaphase sister chromatids on 

25 structures formed in the absence of XCENP-E indicates that depletion of XCENP-E 

prevents congression of chromosomes to the metaphase plate despite apparently normal 
spindle assembly and chromosome attachment. 

Three independent experiments revealed in every case a decrease in the 
percentage of metaphase spindles accompanied by an increased percentage of 

30 bipolar/misaligned and monopolar structures, although the distribution of the aberrant 

structures between the monopolar and bipolar/misaligned classes was variable. Failure of 
XCENP-E depletion to totally prevent the appearance of bipolar spindles with properly 



WO 99/13061 



FCT/US98/I923I 



55 

aligned chromosomes could be due to residual XCENP-E (below detection limit), may 
reflect the actions of other motor proteins functioning in partial redundancy with 
XCENP-E, or may simply reflect that proportion of spindles in which the chromosomes 
were already sufficiently aligned. These data indicate that XCENP-E, or a complex 
containing XCENP-E, is required for chromosome congression. 

To test the possibility that a multiprotein complex had been removed, the 
proteins immunodepleted by a-XCENP-E TArL were compared with those precipitated by 
a-XCENP-E ROD antibody. Examination of the proteins immunodepleted by 
a-XCENP-Ej AIL revealed the presence of multiple bands. This result was not surprising, 
given that XCENP-E is relatively low in abundance compared to other spindle proteins 
such as NuMA (8-14 M g/ml (Merdes, et al., Cell 87:447-458 (1996)), XKCM1 (10 
/zg/ml), and Xklp2 (16 M g/ml, (Boleti, et al., J. Cell. Biol. 125:1303-1312). 

Immunoprecipitates prepared with <*-XCENP-E ROD antibody also contained 
multiple proteins, only two of which were obviously held in common with the a-XCENP- 
Et AIL immunoprecipitate. Immunoblotting of a-XCENP-E TAIL and a-XCENP-E ROD 
immunoprecipitates with a-XCENP-E R0D antibody revealed that one of the proteins is 
XCENP-E, and that the other protein of slightly lower molecular weight, is the additional 
XCENP-E related protein shown earlier to be recognized in unmanipulated extract by the 
a-XCENP-E ROD antibody. The presence of this XCENP-E-related protein in 
immunoprecipitates prepared using the a-XCENP-Er^ antibody, which does not 
directly recognize this lower molecular weight species, provides evidence that like most 
kinesin-related proteins, XCENP-E exists in a complex that is at least dimeric. 

Example V: Addition of q-XCENP-E Antib o dy Disrupts Metanhase Spindle Formation 

As a further test of the requirement of CENP-E in mediating chromosome 
congression, especially in view of the removal of multiple proteins upon immunodepletion 
of XCENP-E, XCENP-E function was perturbed in situ by addition of the monospecific 
a-XCENP-E TAIL antibody to CSF-arrested Xenopus egg extracts. These extracts were 
cycled through interphase and arrested at the subsequent M-phase. 

As observed upon immunodepletion of XCENP-E, addition of a-XCENP- 
Et AIL antibody resulted in almost total elimination of bipolar spindles with properly 
aligned chromosomes. This loss was accompanied by an increase in the percentage of 
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bipolar spindles with misaligned chromosomes, indicating a role for XCENP-E in 
congression. Also observed was a large increase in the proportion of monopolar 
structures suggesting an additional role for XCENP-E in establishment or maintenance of 
spindle bipolarity. Similar results were obtained in four independent experiments, and 

5 also using <*-XCENP-E R0D antibody. 

The monopolar structures observed upon addition of a-XCENP-E antibody 
could arise from disruption of bipolar spindle assembly. This sort of spindle perturbation 
has also been observed following overexpression of the p50 subunit of dynactin, which 
also localizes to kinetochores (Echeverri, et aL, J. Cell Biol. 132:617-633 (1996)). p50 

10 overexpression disrupts spindle bipolarity, yielding two apparent monopoles. On the 

other hand, monopoles may also arise from disruption of sister chromatid cohesion upon 
entry into anaphase (Murray, et aL, Proc. Natl. Acad. Sci. USA 93:12327-12332 (1996)). 
Since apparently non-disjoined sister chromatids are visible in structures formed in the 
presence of a-XCENP-E TAIL antibody, the monopolar structures observed for XCENP-E 

15 are unlikely to be the products of premature anaphase. These findings support a role for 
XCENP-E during prometaphase in establishment or maintenance of bipolarity, as well as 
in congression. 

Consistent with an essential role for XCENP-E in chromosome movement, 
chromosomes associated with monopolar structures formed in the presence of anti-- 

20 XCENP-E antibody were often found distributed both at the periphery and within the 
aster of microtubules. In contrast, the small proportion of monopolar structures formed 
in control extracts chromosomes were invariably localized at the periphery of the aster. 
Unlike perturbation of XKCM1, a relatively abundant Xenopus kinesin superfamily 
member, which induces formation of large asters as a consequence of decreased 

25 microtubule catastrophe (Walczak, et al. f Cell 84:3747 (1996)), the asters formed in 
extracts to which a-XCENP-Ej^ antibody was added, or from which XCENP-E had 
been removed, were not unusually large. This observation suggests that XCENP-E does 
not play a role in regulating microtubule dynamics analogous to that played by XKCM 1 . 
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WHAT IS CLAIMED IS : 

1. An isolated, biologically active CENP-E protein, wherein the CENP-E 
protein has the following properties: (i) at least one activity selected from the group 
consisting of plus end-directed microtubule motor activity, ATPase activity, and 
microtubule binding activity; and (ii) the ability to specifically bind to polyclonal 
antibodies generated against CENP-E. 

2. An isolated protein of claim 1, wherein the protein has an amino acid 
sequence having at least 55% sequence identity with a Xenopus CENP-E (XCENP-E) 
motor domain of SEQ ID NO:l. 

3. An isolated protein of claim 1, wherein the protein is encoded by a 
nucleic acid sequence having at least 70% sequence identity to SEQ ID NO:2. 

4. An isolated protein of claim 1, wherein the protein is encoded by a 
nucleic acid that hybridizes under high stringency conditions to a nucleic acid having a 
seuqence complementary to that of SEQ ID NO:2. 

5. An isolated protein of claim 1, wherein the protein has an amino acid 
sequence of SEQ ID NO:l. 

6. An isolated protein of claim 1, wherein the protein is from a human. 

7. An isolated protein of claim 1, wherein the protein specifically binds to 
antibodies generated against Xenopus CENP-E (XCENP-E). 

8. An isolated protein of claim 7, wherein the protein has an amino acid 
sequence having greater than 70% sequence identity with a Xenopus CENP-E (XCENP- 
E) motor domain of SEQ ID NO:l. 
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9. An isolated protein of claim 7, wherein the protein has an amino 
acid sequence of a Xenopus CENP-E (XCENP-E) motor domain of SEQ ID NO:l. 

10. An isolated protein of claim 1, wherein the protein specifically binds 
to antibodies generated against a motor domain of Xenopus CENP-E (XCENP-E). 

11. An isolated protein of claim 1, wherein the protein has an average 
molecular weight of about 300-350 kDa. 

12. An isolated nucleic acid sequence encoding a CENP-E gene product, 
said sequence encoding a protein having a core motor domain that has greater than 70% 
amino acid sequence identity to a Xenopus CENP-E (XCENP-E) core motor domain as 
measured using a sequence comparison algorithm, and specifically binding to antibodies 
raised against CENP-E. 

13. An isolated nucleic acid sequence of claim 12, wherein the sequence 
has a nucleotide sequence of SEQ ID NO:2. 

14. An isolated nucleic acid sequence of claim 12, wherein the 
sequence comparison algorithm is PILEUP. 

15. An isolated nucleic acid sequence of claim 12, wherein the 
sequence encodes a protein having an average molecular weight of about 300-350 kDa. 

16. An antibody which specifically binds to Xenopus CENP-E (XCENP- 

E). 

17. A method for identifying a candidate agent as a compound that 
modulates CENP-E activity, said method comprising the steps of: 

(i) determining CENP-E activity in the presence of a candidate agent at a control 
concentration, wherein the CENP-E activity comprises at least one activity selected from 
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the group consisting of plus end-directed microtubule motor activity, ATPase activity, 
and microtubule binding activity; and 

(ii) determining said CENP-E activity in the presence of the candidate agent at a 
test concentration, wherein a change in activity between the test concentration and the 
control concentration of said candidate agent indicates the identification of a compound 
that modulates CENP-E activity. 

18. A method of claim 17, further comprising the step of isolating 
biologically active CENP-E from a cell sample. 

19. A method of claim 17, wherein the biologically active CENP-E is 
human CENP-E. 

20. A method of claim 17, wherein the biologically active CENP-E is 
Xenopus CENP-E (XCENP-E). 

21. A method of claim 17, wherein the biologically active CENP-E 
comprises a motor domain of Xenopus CENP-E (XCENP-E). 

22. A method of claim 17, wherein the biologically active CENP-E 
comprises an amino acid sequence of a XCENP-E motor domain of SEQ ID NO:l. 

23. A method of claim 17, wherein said activity is plus end-directed 
microtubule motor activity. 

24. A method of claim 17, wherein said change in activity is a decrease. 

25. A method of claim 17, wherein said change in activity is an increase. 

26. A method of claim 17, wherein said compound identified is a lead 
therapeutic compound. 
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27. A method of claim 26, wherein said compound is an antibody. 



28. A method of claim 27, wherein said antibody specifically binds human 

CENP-E. 

29. A method of claim 27, wherein said method further comprises 
modifying said antibody to be a humanized antibody. 

30. A method of claim 17, wherein said compound identified is a lead 
bioagricultural compound. 

31. A method of claim 17, wherein said compound identified is a lead 
diagnostic compound. 

32. A method of claim 31, wherein said compound identified is an 

antibody. 

33. A method of claim 32, wherein said antibody specifically binds to 
human CENP-E. 

34. A method of claim 32, wherein said method further comprises 
modifying said antibody to be a humanized antibody. 

35. A method of claim 17, wherin said method is performed in a plurality 
and wherein said plurality of methods are performed simultaneously. 

36. A kit for screening for modulators of CENP-E, said kit comprising; 

(i) a container holding biologically active CENP-E; and 

(ii) instructions for assaying for CENP-E activity, wherein the CENP-E 
activity is plus end-directed microtubule motor activity or ATPase activity. 
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37. A kit of claim 36, wherein the biologically active CENP-E is Xenopus 
CENP-E (XCENP-E). 

38. A kit of claim 36, wherein the biologically active CENP-E is the 
motor domain of Xenopus CENP-E (XCENP-E). 

39. A method of producing a biologically active CENP-E polypeptide 

comprising: 

(i) transforming a cell with a vector comprising the nucleic acid sequence 
encoding the motor domain of CENP-E; 

(ii) expressing said nucleic acid to produce a gene product; 

(iii) purifying said gene product; and 

(iv) identifying ATPase activity or plus-end directed microtubule activity of said 
gene product. 

40. A method of moving microtubules in a plus ended direction, said 
method comprising contacting microtubules with CENP-E. 

41. A method of claim 40, wherein said method further comprises the step 
of transforming a cell with a vector comprising a gene encoding said CENP-E and 
expressing said gene. 



42. A method of claim 41, wherein said CENP-E affects proliferation of 

said cell. 
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NFSKKAKFSDMPSFPEIDDSVq rEFlSBFDDA^MMDSNGIDA|EWIj]LBsBV 500 

THREKTSLHQSMIDFGQI SDSVQFHDSSKENQLQYLPKDSGDMAECRKAS 550 

FEKEITSLQQQLQSKEEEKKELVQSFELKIAELEEQLSVKAKNLEMVTNS 600 

IREHS INAl EVOTDVEKEWRKEMSVLGDSGYNASNSDLQDSSVDGKRLSSS 650 

HDECIEHRKMLEQKIVDLEEFIENLNKKSENDKQKSSEQDFMESIQLCEA 700 



IMAEKA@ALEi2ALM§ONFgll I0ENiT0KRUl ADlURSEKgNQlrNiFl 7 50 



800 



ij^KEgcJfeHi^^ 
I@L AE0RKR A0N LQKKVRNFlDLS|Vg|MG D SEKLCEE I FOll^SE)SgAgA0T 850 
Rl^OKECS FLRSENLELKF^j EDTSNVQ^iKAASBFEK^lTEKSNYB 900 

[KMtE AlDnc fCELO S AFN E I| N YjpN GPL A^KVP|R^LgS RVjjQ2ll<lElV S EF S KIqIQE 950 

§ALilKN^ENi^CLSEYKF^EYlcP<NQESKAlSEllMLLKQiGEHS 1000 

Ail I SKCjEjl IMQEQSiQ(l]LQLTBEVTH^SlKVQQSEEQ YLEiy@KMiD0LF 1050 

E^IRNKSglAi)Dp J RlMENLKGTMESVEVKIADTKHi2EiTliDKEQLBH 1100 

gKKYFFQAMQTIFPIligSDSLPPSKEkEGNSQDPlillNDYHNBlALATE 1150 

RNNIMVC0ETilNSlLEEQVlg^lTQB2i2QAQSIEKSDB2KPKQDLEEGE 12 00 

VKLLLEMELlffcHLTDSQLSCDEiS^ 12 50 

E|lZ^NFiDLK{^HDS[Lg^ 1300 
FRQQLLDCSVGISSPNHDA0ANQE§VSLGEVNSLOSEMLRGERDELQTSC 1350 
KALVSiLELLPJ\HVK|SlVEGENBEITKKBNGLEiKEILGESEESEVLKSMLE 1400 
NLKE§NNKLKEQAEEYSSKENQFSLEl|VFSGSQKLVCi)lEVLKAQElAAE 1 4 50 
(eJrlE I KD@D YEHLVQSAN TN LVEGEOE TP LQ AD(HEE|) S IDRP^£He[EM|!|I K V 1500 
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0GiKLERNQ YLOE@LQEiKQEBSNiLE I L0E#^ETSVLLkSdD2QKL@sB 1550 
LSENIlEHNlDTfgKHHSDTQ§QLOKTQQELQLAKiLAIAASDNCP|lTQ 1600 
iKlTSADCVHPll)EKIp^EEOiQKT^EKLLH|SNEQEQASlVEQKCi) 1 650 
VEHl!3MKSMIESKSSLESLQHEKHDfl^Q|l^A p^ MQVVTQEKKE03QTH 1700 
EHLTAEVDHP<.ENIELGLNFKPEA^}KTT^E)QC0LNEN§ELE0SQHRLQC 1750 
ilEELMKSESDg^iA2ETLK|lSEQKVINLNQ0ylEMVMI#fflLKNSQRTV 1800 
I A ERDOLP DDLRlElSfVlEMS I@TQDDLRK§QEALOQQKDKVQEp" SQ I SVLQ 1850 
EKISLLlNQMLYNVATVKEIrEsElRDDLNQSKQHLFSEliTOSLSEKSKEF 1900 
ALEQAE0DK AD AARKT I D I TE|l SNpEQ LLQQ ATlLKET^ERE SLIQC 1950 
KEQLAgNTE|LRETl^SKDLALG^!EQERDiAAN0VIALTEKMSSLEEQI 2000 
VTTg^EGiGEKEllF Y LQRP S|^ME)EfLtRE S LgrKDODBEEAEK 2 050 

E I SEATlfO|KiL)TAK I S SgEEgl LQ^AS I LNEAVS^EflLRTHSKQCfflVSl 2100 
LE^LT^igHAFAOSKP^DiA0NllASLAEilKll]LTKEMDEFRDiKl 2150 
ESgQjEQSSHpEELCTYgT|L^lBKQQiE^lSNKLAEgVKi^DE0LQHLS 2200 
SL^EQI^^I^ELRNE^^NY^C^MD(^E^llSV£§LMQNEPQQ^lE0D 22 50 
VAERMD I LE SpQE I QELMEK I SAVYSEQHTjpLS|3S SELQKET@AH0H C 2 3 0 0 
MLNIKESLS^LSRSFGSB2TEHVKgNTQLQTLLNgF§\@RSAAVKgDl 2350 
SLp^DYiKOLAAiQKRHDi^LQLOCBEpHGRKWSBS^lEllKFCEIEFL 2 400 
NELLFKKANllTjQSVQD^FJFjVQVFLNQVGSTLSlI^HKEGFMQWEESFG 2 4 50 
DLH VQAK^SEgMOQiSRElAST I QLLTKiLK@VVQS KIQR@ITVYLNQF 2 500 
iAK02EKilQiKEBMRRMEHHGPiASVMEEESARLLGILKTVQDEiKKLg 2 550 
SR|KMLiiEpLVKDD^HSGElVAILQDKBLSPSAEAELNAMQVELTKK 2 600 

QDNj|3QAAMKE[!lEN[L^ 2 650 

DC^AYL^Cl^DKF^L0R0KiEERgAOADNDiT0CVPKDYQKAiTFiv 27 00 
fTCGGGSGIVQa ilAMEV^l^KAAElP^lgHYigKKYHHBSRTMSSSEDEKg 2750 
^lAKSDAgsiHTGSSHRGSPHElilTYRHGP^lPERSEMBSLHLGEEHKSE 2800 

sst^rvvspnrseiysqlvmspgktghhkhilspskvglhkkralspnrs 2850 
emptqhvispgktglh^iiSestiEdnlsSpOE)2QkvqenlnIHp1gklEO 2 900 

VKElS)>flYCESQFEDNlKE33FSElLNT@ESNDKSQAENW0YEAKKETAEl 2950 

FIG. 1C2. 
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SEQUENCE LISTING 



SEP ID NO : 1 XCENP E PROTEIN: 



MSEGDAVKVCVRVRPLIQREQGDQANLQWKAGNNTISQVDGTKSFNFDRV 

FNS HE S TSQ I YQE I AVP 1 1 RS ALQGYNGT I FAYGQTS SGKTY TMMGTPNS 

LG I I PQAI QEVFKI I QE I PNRE FLLRVS YME I YNETVKDLLCDDRRKKPL 

EIREDFNRNVYVADLTEELVMVPEHVIQWIKKGEKNRHYGETKMNDHSSR 

SHTI; RMIVESRDRNDPTNSENCDGAVMVSHLNLVDLAGSERASQTGAEG 

VRLKEGCNINRSLFILGQVIKKLSDGQAGGFINYRDSKLTRILQNSLGGN 

AKTVI I CT I TP VS FDETLS TLQFAS TAKHVRNTPHVNEVLDDEALLKRYR 

KE I LDLKKQLENLES S SETKAQAMAKEEHTQLLAE I KQLHKEREDR I WHL 

TNIWASSQESQQDQRVKRKRRVTWAPGKIQNSLHASGVSDFDMLSRLPG 

NFSKKAKFSDMPSFPEIDDSVCTEFSDFDDALSMMDSNGIDAEWNLASKV 

THREKTSLHQSMIDFGQISDSVQFHDSSKENQLQYLPKDSGDMAECRKAS 

FEKEITSLQQQLQSKEEEKKELVQSFELKIAELEEQLSVKAKNLEMVTNS 

REHSINAEVQTDVEKEWRKEMSVLGDSGYNASNSDLQDSSVDGKRLSSS 

HDECIEHRKMLEQKIVDLEEFIENLNKKSENDKQKSSEQDFMESIQLCEA 

I MAEKANALEELALMRDNFDNI I LENETLKRE I ADLERS LKENQETNEFE 

ILEKETQKEHEAQLIHEIGSLKKLVENAEMYNQNLEEDLETKTKLLKEQE 

IQLAELRKRADNLQKKVRNFDLSVSMGDSEKLCEEIFQLKQSLSDAEAVT 

RDAQKECSFLRSENLELKEKMEDTSNWYNQKEKAASLFEKQLETEKSNYK 

KMEADLQKELQSAFNEINYLNGLLAGKVPRDLLSRVELEKKVSEFSKQLE 

KALEEKNALENEVTCLSEYKFLPNEVECLKNQISKASEEIMLLKQEGEHS 

AS 1 1 SKQEI IMQEQSEQILQLTDEVTHTQSKVQQTEEQYLEMKKMHDDLF 

EKY I RNKSEAEDLLREMENLKGTMES VEVKI ADTKHELEETI RDKEQLLH 

EKKYFFQAMQTIFPITPLSDSLPPSKLVEGNSQDPIEINDYHNLIALATE 

RNNIMVCLETERNSLKEQVIDLNTQLQSLQAQSIEKSDLQKPKQDLEEGE 

VKLLLEMELLKGHLTDSQLS I EKLQLENLEVTEKLQTLQEEMKN I T I ERN 

ELQTNFEDLKAEHDSLKQDLSENI EQS I ETQDELRAAQEELREQKQLVDS 

FRQQLLDCSVGISSPNHDAVANQEKVSLGEVNSLQSEMLRGERDELQTSC 

KALVSELELLRAHVKSVEGENLEITKKLNGLEKEILGKSEESEVLKSMLE 

NLKEDNNKLKEQAEEYSSKENQFSLEEVFSGSQKLVDEIEVLKAQLKAAE 

ERLEIKDRDYFELVQTANTNLVEGKLETPLQADHEEDSIDRRSEEMEIKV 

LGEKLERNQYLLERLQEEKLELSNKLEILQKEMETSVLLKDDLQQKLESL 

LSENIILKENIDTTLKHHSDTQAQLQKTQQELQLAKNLAIAASDNCPITQ 

EKETSADCVHPLEEKILLLTEELHQKTNEQEKLLHEKNELEQAQVELKCE 
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VEHLMKSMIESKSSLESLQHEKHDTEQQLLALKQQMQWTQEKKELQQTH 
EHLTAEVDHLKENIELGLNFKNEAQQKTTKEQCLLNENKELEQSQHRLQC 
EIEELMKSLKDKESALETLKESEQKVINLNQEMEMVMLEMEELKNSQRTV 
IAERDQLQDDLRESVEMS IETQDDLRKAQEALQQQKDKVQELTSQI SVLQ 
EKISLLENQMLYNVATVKETLSERDDLNQSKQHLFSEIETLSLSLKEKEF 
ALEQAEKDKADAARKTIDITEKISNIEEQLLQQATNLKETLYERESLIQC 
KEQLALNTEHLRETLKS KDLALGKMEQERDEAANKVI ALTEKMS S LEEQ I 
NENVTTLKEGEGEKETFYLQRPSKQQSSSQMEELRESLKTKDLQLEEAEK 

E I SEATNE I KNLTAKI S SLEEE I LQNAS I LNEAVS ERENLRHS KQQLVSE 
LEQLSLTLKSRDHAFAQSKREKDEAVNKIASLAEEIKILTKE^4DEFRDSK 
ESLQEQSSHLSEELCTYKTELQMLKQQKEDINNKLAEKVKEVDELLQHLS 
SLKEQLDQIQMELRNEKLRNYELCEKMDIMEKEISVLRLMQNEPQQEEDD 

VAERMDI LESRNQE I QELMEKI SAVYS EQHTLLSSLS SELQKETEAHKHC 
ML^IKESLSSTLSRSFGSLQTEHVKLNTQLQTLLNKFKVVYRTAAVKEDH 
SLIKDYEKDLAAEQKRHDELRLQLQCLEQHGRKWSDSASEELKFCEIEFL 
NELLFKKANIIQSVQDDFSEVQVFLNQVGSTLQEELEHKKGFMQWLEEFG 
DLHVDAKKLSEGMQQENRRIASTIQLLTKRLKAWQSKIQREITVYLNQF 
EAKLQEKKEQNKELMRRMEHHGPSASVMEEENARLLGILKTVQDESKKLQ 
SRIKMLENELNLVKDDAMHKGEKVAILQDKLLSRNAEAELNAMQVKLTKK 

QDNLQAAMKE I ENLQKMVAKGAVP YKEE I DNLKTKWKI EME KI KYS KAT 
DQEIAYLKSCLEDKEEGLRRLKEELRRAQADNDTTVCVPKDYQKASTFPV 
TCGGGSGIVQSTAMLVLQSEKAALERELSHYKKKYHHLSRTMSSSEDRKK 
TKAKSDAHSSHTGSSHRGSPHKTETYRHGPVTPERSEMPSLHLGSPKKSE 
SSTKRWSPNRSEIYSQLVMSPGKTGMHKHILSPSKVGLHKKRALSPNRS 
EMPTQHVISPGKTGLHKNLTESTLFDNLSSPCKQQKVQENLNSPKGKLFD 
VKSKSMPYCPSQFFDNSKLGDFSELNTAESNDKSQAENWWYEAKKETAPE 

CKTS 

SEP ID NO: 2 XCENP-E NUCLEOTIDE 

GAATTCCGGAGTCGGATAGGCTAGTCGGGCGAGGGAAATTCAAACTGGTT 
ATAGAAGAAACTTGAACCGCCGCCAAAAGGGACTAAAGTGACAGAGACAG 
GGAGCGGTGTCGGTACCGATTTCTCCACTAATCGGTCTCAAAATGTCCGA 
GGGAGATGCAGTTAAAGTGTGTGTGAGGGTTCGGCCGCTTATACAGAGAG 
AACAAGGGGATCAAGCCAACCTGCAATGGAAGGCTGGAAACAACACCATT 
TCCCAAGTTGATGGGACAAAGTCTTTCAATTTCGATCGTGTATTTAATTC 
TCACGAATCAACAAGTCAAATTTACCAAGAAATAGCAGTACCTATCATAC 
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GATCAGCTTTGCAGGGATATAATGGCACAATATTTGCATACGGACAGACA 
TCTTCAGGCAAGACGTACACAATGATGGGAACACCAAATTCATTGGGCAT 
AATAC C C CAAGC C AT ACAGGAAGTTTTTAAAATTATTC AGGAGATAC CGA 
ACAGAGAGTTTCTTCTAAGAGTTTCTTATATGGAGATTTACAATGAAACT 
GTGAAAGACCTACTGTGTGATGACAGAAGAAAGAAGCCCTTGGAAATTCG 
CGAGGATTTTAATAGAAACGTGTATGTTGCTGACCTGACTGAAGAACTTG 
TAATGGTTCCTGAACATGTAATACAGTGGATCAAAAAGGGTGAAAAAAAC 
AGACATTATGGAGAGACTAAAATGAATGATCATAGTAGTCGTTCACATAC 
AATATTTAGAATGATTGTTGAAAGCCGAGACAGAAATGATCCCACAAATT 
CAGAGAACTGTGATGGAGCTGTCATGGTATCTCACTTGAATTTGGTAGAT 
CTTGCTGGCAGTGAAAGAGCAAGCCAAACTGGAGCTGAAGGTGTGAGACT 
TAAGGAAGGCTGCAACATCAACCGCAGCTTGTTTATCCTTGGACAGGTTA 
TTAAGAAGCTTAGCGACGGCCAGGCTGGTGGATTTATAAACTACAGAGAC 
AGCAAACTCACCAGAATTCTCCAAAATTCATTGGGAGGAAATGCTAAAAC 
GGTTATAATTTGCACAATTACGCCAGTTTCTTTTGATGAGACTCTAAGTA 
CACTTCAGTTTGCCAGTACTGCCAAACATGTGAGAAATACTCCCCATGTT 
AATGAGGTCCTGGATGATGAAGCGTTGCTAAAAAGGTACAGAAAGGAAAT 
CTTGGATTTAAAGAAACAATTAGAGAATTTAGAGTCATCGTCTGAAACAA 
AAGCTCAAGCAATGGCTAAAGAAGAGCATACACAGTTGCTAGCTGAAATC 
AAACAACTACACAAAGAGAGAGAAGATAGAATATGGCACTTGACAAATAT 
TGTTGTTGCTTCATCCCAAGAATCTCAACAGGACCAAAGGGTCAAACGAA 
AACGAAGAGTTACGTGGGCGCCAGGAAAAATCCAAAATAGTTTACATGCT 
TCTGGTGTTTCTGACTTTGATATGCTATCCAGATTACCTGGCAATTTTAG 
CAAGAAGGCAAAGTTCTCTGACATGCCTTCATTTCCAGAAATTGATGACT 
CTGTTTGTACAGAGTTTTCTGATTTTGATGACGCCCTCTCCATGATGGAC 
AGCAATGGAATAGATGCAGAATGGAATTTAGCCAGTAAAGTAACTCACAG 
AGAAAAGACATCACTTCATCAATCAATGATAGACTTTGGACAGATTTCTG 
ACAGTGTTCAGTTTCATGATTCTTCTAAGGAAAACCAGCTACAATACCTC 
CCCAAAGACTCTGGTGATATGGCTGAATGCAGAAAAGCTTCTTTTGAAAA 
AGAGATCACAAGCCTCCAGCAACAACTACAGTCAAAGGAGGAAGAAAAAA 
AGGAACTTGTACAAAGCTTCGAGCTCAAGATAGCAGAACTGGAAGAGCAG 
CTTAGTGTCAAAGCTAAAAATCTAGAGATGGTTACAAACTCGAGAGAGCA 
TTCCATTAATGCTGAAGTCCAAACAGATGTTGAAAAGGAAGTTGTGAGAA 
AAGAAATGTCAGTCCTTGGAGACTCTGGTTACAATGCATCAAACAGTGAC 
CTACAGGATAGTTCTGTTGATGGTAAACGTCTAAGCAGCTCCCATGATGA 
GTGTATAGAACACAGAAAAATGCTGGAACAAAAGATCGTTGATTTAGAAG 
AGTTTATTGAAAACCTTAACAAGAAAAGTGAGAATGATAAACAAAAATCT 
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TCTGAGCAAGATTTTATGGAGAGTATTCAGCTATGTGAAGCTATAATGGC 

AGAAAAGGCAAATGCACTGGAGGAACTGGCACTTATGCGAGATAATTTTG 

ACAATATTATTCTAGAGAATGAAACTCTAAAAAGGGAAATTGCAGATCTG 

GAACGTTCACTCAAGGAGAATCAAGAAACCAATGAGTTTGAAATTCTGGA 

GAAGGAAACTCAAAAAGAACACGAGGCACAACTAATCCATGAGATTGGCA 

GTTTAAAGAAATTAGTTGAAAATGCAGAGATGTACAATCAAAATCTTGAG 

GAAGATCTAGAAACTAAAACAAAACTTCTGAAAGAGCAGGAAATTCAACT 

TGCAGAATTAAGGAAACGCGCAGATAACTTGCAGAAAAAAGTACGAAATT 

TTGATCTCTCGGTTTCCATGGGTGATAGTGAGAAACTCTGTGAAGAAATC 

TTTCAACTGAAGCAATCTCTTTCTGATGCTGAAGCTGTGACTCGCGATGC 

TCAGAAGGAATGTTCTTTCCTCAGAAGTGAAAATCTAGAGCTGAAGGAGA 

AAATGGAGGACACATCAAACTGGTACAATCAAAAAGAAAAGGCTGCGTCT 

TTGTTTGAGAAGCAGCTGGAAACTGAAAAATCAAACTACAAGAAAATGGA 

AGCTGATTTGCAGAAAGAGTTGCAAAGTGCTTTTAATGAGATTAACTACT 

TAAATGGCCTTCTGGCAGGAAAGGTCCCCAGAGATTTGCTTTCTCGTGTT 

GAATTAGAGAAAAAGGTTTCTGAGTTCTCAAAGCAGCTTGAGAAAGCATT 

GGAAGAAAAAAATGCCTTGGAGAATGAAGTGACTTGCCTATCAGAATACA 

AATTTTTGCCAAATGAAGTTGAATGCTTGAAAAATCAGATCAGCAAGGCT 

TCTGAAGAGATAATGTTATTAAAGCAAGAAGGAGAACATTCTGCATCTAT 

TATAAGCAAACAAGAGATTATCATGCAGGAGCAATCTGAGCAGATTTTAC 

AACTGACTGACGAAGTGACACACACACAGTCAAAAGTGCAGCAGACTGAA 

GAGCAATACTTGGAGATGAAGAAAATGCATGATGATCTTTTTGAAAAGTA 

TATCAGAAACAAAAGTGAAGCTGAAGACCTTTTAAGAGAAATGGAGAACC 

TTAAAGGCACTATGGAGTCTGTGGAAGTAAAGATTGCTGACACAAAACAT 

GAACTTGAAGAAACTATAAGGGATAAGGAGCAACTGCTTCATGAGAAAAA 

ATACTTTTTTCAAGCAATGCAGACTATATTTCCGATTACACCTCTTTCAG 

ACTCGCTTCCTCCCTCAAAATTAGTTGAAGGGAACTCTCAAGACCCCATA 

GAAATCAATGACTACCACAATTTAATAGCCCTTGCTACAGAAAGGAACAA 

CATTATGGTGTGTCTAGAGACTGAAAGAAACAGTCTCAAGGAGCAAGTTA 

TTGATTTGAACACTCAACTTCAAAGTCTTCAAGCACAAAGTATTGAAAAG 

TCTGATCTCCAGAAGCCAAAGCAAGACTTGGAAGAAGGAGAGGTTAAACT 

GCTTTTGGAGATGGAACTACTGAAGGGACACCTAACTGACTCACAGCTGT 

CTATAGAGAAATTGCAGCTGGAGAATCTGGAAGTTACAGAAAAACTCCAA 

ACACTTCAAGAAGAGATGAAAAATATTACTATAGAAAGGAATGAGCTTCA 

GACCAACTTCGAAGACCTGAAAGCAGAGCATGATAGCCTAAAACAAGACC 

TTAGTGAAAACATTGAGCAGTCAATTGAAACACAAGATGAATTAAGGGCT 

GCCCAGGAAGAGCTAAGAGAACAGAAGCAACTGGTTGATAGCTTTAGACA 
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ACAGCTTTTAGATTGTTCTGTAGGAATTTCATCACCAAACCATGATGCAG 

TTGCTAACCAGGAAAAGGTGTCATTGGGTGAAGTTAATTCGTTACAAAGT 

GAAATGCTGCGTGGTGAAAGAGATGAGCTTCAAACATCTTGTAAGGCATT 

AGTTTCAGAACTGGAGCTACTTCGTGCTCATGTAAAATCTGTGGAGGGAG 

AAAACCTTGAAATCACAAAAAAACTCAATGGCCTTGAAAAGGAGATATTG 

GGCAAATCTGAAGAAAGCGAAGTGTTGAAGTCCATGTTGGAGAATCTAAA 

GGAAGACAACAATAAGCTCAAAGAACAAGCAGAGGAATATTCTAGTAAAG 

AAAATCAATTCAGTTTAGAAGAGGTGTTCAGTGGTTCACAGAAGCTGGTA 

GATGAGATAGAGGTCCTGAAAGCACAGCTAAAGGCAGCAGAAGAAAGGCT 

GGAAATAAAGGATAGAGATTATTTTGAACTTGTACAAACTGCAAACACCA 

ATTTAGTTGAGGGAAAATTGGAAACTCCATTGCAAGCTGACCATGAGGAA 

GACAGCATTGATCGGCGTTCTGAAGAAATGGAGATAAAAGTTCTTGGAGA 

AAAACTTGAGCGAAATCAGTATTTACTGGAAAGATTGCAAGAAGAAAAGC 

TGGAACTGTCTAACAAACTTGAAATCCTTCAGAAAGAGATGGAGACGTCC 

GTTCTATTAAAAGATGACCTGCAACAGAAGCTAGAAAGCTTGCTGAGTGA 

AAACATTATTCTAAAAGAGAATATTGACACAACCCTAAAGCATCATTCAG 

ATACTCAAGCTCAGCTGCAGAAAACACAGCAAGAGCTACAGTTGGCTAAG 

AATCTTGCAATCGCTGCTTCTGACAATTGTCCAATAACTCAAGAAAAGGA 

AACCTCTGCAGATTGTGTGCATCCTCTGGAAGAAAAGATATTGTTATTAA 

CTGAAGAATTGCATCAAAAAACTAATGAACAGGAAAAATTACTACATGAA 

AAGAATGAACTTGAACAAGCTCAGGTTGAGCTAAAGTGCGAGGTGGAACA 

TCTGATGAAGAGTATGATCGAATCGAAGTCCTCACTTGAGTCCTTGCAGC 

ATGAGAAACATGATACTGAACAACAACTTCTTGCTCTTAAACAGCAGATG 

CAAGTAGTTACTCAAGAAAAGAAAGAGCTGCAACAAACCCATGAACACTT 

AACAGCTGAGGTGGACCATCTAAAAGAGAATATTGAATTGGGTTTGAATT 

TTAAAAATGAAGCGCAGCAAAAGACCACTAAAGAGCAATGTCTGCTAAAT 

GAGAATAAAGAACTTGAGCAAAGCCAGCACAGACTTCAATGTGAGATAGA 

AGAGCTTATGAAAAGCTTAAAGGATAAAGAGTCAGCGCTGGAAACTTTAA 

AAGAATCTGAACAAAAAGTAATTAATCTGAACCAAGAAATGGAAATGGTT 

ATGCTGGAAATGGAGGAATTGAAAAATAGCCAGAGGACTGTAATTGCTGA 

GAGGGACCAGCTGCAAGACGACCTAAGGGAGAGTGTTGAAATGTCCATTG 

AAACTCAAGATGATCTAAGAAAGGCTCAAGAAGCATTGCAGCAGCAGAAA 

GATAAAGTTCAGGAACTGACCTCCCAGATTTCTGTCCTGCAGGAAAAGAT 

CTCTCTTCTGGAAAATCAGATGCTTTATAATGTTGCAACTGTGAAAGAAA 

CTCTAAGCGAAAGAGATGACCTGAACCAGTCTAAGCAACACCTGTTCTCA 

GAAATTGAAACTCTTAGTCTGTCTTTAAAAGAAAAGGAATTTGCATTGGA 

ACAAGCAGAGAAGGACAAAGCTGATGCTGCCAGGAAAACAATAGATATCA 



v * WO 99/13061 



PCT/US98/1923I 



6 

CAGAGAAGATATCAAATATAGAAGAACAGTTACTTCAACAAGCCACCAAT 
TTAAAGGAAACTCTTTATGAAAGAGAGAGTCTTATCCAGTGTAAGGAGCA 
ACTGGCTTTGAACACAGAACACCTTAGGGAAACATTGAAGAGCAAAGACT 
TGGCATTGGGTAAAATGGAGCAGGAGAGAGATGAAGCTGCCAATAAAGTA 

ATAG CTCTTACAGAAAAGATGT CTT CT CTAGAAGAAC AGATC AATGAAAA 

TGTTACTACTCTAAAAGAAGGTGAGGGTGAAAAAGAGACCTTCTACCTTC 

AGAGACCTTCAAAGCAGCAGTCTTCTTCCCAGATGGAAGAGCTCAGAGAG 

TCTTTAAAGACCAAAGATTTGCAGTTGGAAGAGGCCGAGAAGGAGATAAG 

TGAAGCTACTAATGAAATAAAGAATCTCACTGCTAAGATCTCTTCTCTAG 

AGGAGGAGATTCTTCAGAATGCTAGCATTTTGAATGAAGCTGTAAGCGAA 

AGGGAAAACCTTCGCCATTCGAAGCAGCAACTGGTTTCAGAATTGGAGCA 

GCTATCACTGACATTAAAGAGTAGAGACCATGCATTTGCGCAATCTAAAC 

GCGAAAAGGATGAAGCTGTAAATAAAATAGCCAGTCTCGCTGAAGAAATA 

AAGATCCTGACAAAAGAGATGGATGAATTCAGAGATTCAAAGGAATCCTT 

GCAAGAACAGTCTTCCCATCTAAGTGAAGAGTTATGTACATATAAGACTG 

AACTTCAAATGCTCAAGCAACAGAAAGAAGACATCAACAACAAACTTGCA 

GAGAAAGTTAAGGAAGTGGATGAGCTATTGCAACACTTATCATCTCTAAA 

GGAACAGCTGGACCAAATACAGATGGAGCTAAGGAATGAAAAGCTCAGAA 

ACTATGAACTCTGCGAAAAGATGGATATCATGGAAAAAGAAATCTCAGTG 

CTGCGTTTAATGCAGAACGAGCCTCAGCAGGAAGAAGATGATGTTGCAGA 

ACGTATGGATATACTTGAGAGCAGAAACCAAGAAATACAGGAGCTGATGG 

AAAAAATCTCCGCTGTGTATTCAGAGCAACACACTTTGCTCAGCAGTCTC 

TCTAGTGAGCTTCAAAAGGAAACTGAAGCACACAAACATTGCATGTTAAA 

TATAAAGGAATCTCTGTCATCCACGCTCTCCAGATCCTTTGGCAGCTTGC 

AAACTGAGCATGTTAAGCTAAATACTCAACTGCAGACCCTTCTGAACAAA 

TTTAAGGTTGTATACCGAACTGCTGCAGTCAAAGAAGATCATAGCTTGAT 

CAAAGATTATGAGAAGGACCTTGCTGCTGAGCAAAAGAGGCATGATGAGC 

TGCGACTCCAACTGCAGTGTTTGGAGCAGCACGGCAGAAAATGGTCGGAT 

TCTGCATCTGAGGAACTCAAGTTCTGTGAAATTGAATTCTTGAATGAGTT 

ACTTTTTAAAAAAGCAAATATAATTCAGAGTGTCCAGGATGACTTTTCAG 

AGGTGCAGGTATTCCTAAATCAAGTAGGATCAACACTGCAAGAAGAGCTT 

GAGCACAAGAAAGGCTTTATGCAGTGGTTGGAGGAATTTGGAGATCTGCA 

CGTCGATGCTAAGAAACTCAGTGAAGGCATGCAACAGGAAAATAGGCGCA 

TTGCTTCTACCATACAGCTCTTAACAAAAAGGCTAAAGGCAGTTGTTCAG 

TCGAAAATACAACGTGAGATAACCGTATATCTGAACCAGTTTGAAGCGAA 

ATTGCAAGAGAAGAAAGAACAAAACAAAGAACTTATGCGCAGAATGGAGC 

ACCACGGCCCTAGTGCTAGTGTAATGGAGGAAGAAAATGCTAGACTTTTA 
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GGCATACTGAAAACTGTTCAAGATGAATCCAAGAAACTCCAATCAAGGAT 
CAAAATGCTAGAAAATGAACTGAACTTGGTCAAAGATGATGCCATGCACA 
AAGGTGAAAAAGTTGCAATTTTGCAAGACAAACTACTAAGCAGAAACGCC 
GAAGCTGAGCTAAACGCAATGCAGGTGAAACTAACTAAAAAGCAAGATAA 
TCTTCAGGCTGCAATGAAAGAAATAGAAAACCTACAGAAAATGGTTGCCA 
AAGGTGCAGTACCATATAAAGAAGAAATTGACAACCTTAAAACTAAGGTG 
GTAAAGATTGAAATGGAAAAAATAAAGTACTCAAAAGCAACAGACCAAGA 
GATTGCCTACTTAAAGTCTTGTTTGGAAGATAAGGAAGAAGGCTTGCGTA 
GGTTAAAAGAGGAACTTAGGCGAGCACAGGCAGACAACGATACAACAGTT 
TGTGTTCCAAAAGATTATCAGAAAGCTTCAACTTTCCCTGTGACTTGTGG 
TGGTGGAAGCGGTATAGTGCAGAGCACAGCAATGCTTGTGCTGCAGTCGG 
AAAAAGCCGCCTTGGAAAGGGAGCTGTCACATTATAAGAAGAAATATCAT 
CATTTATCACGGACTATGTCAAGTTCTGAAGATCGAAAGAAAACAAAGGC 
AAAATCTGATGCTCATTCTTCTCATACTGGATCATCACACAGAGGCTCGC 
CTCACAAAACTGAAACTTACAGACATGGCCCTGTTACTCCAGAAAGGTCT 
GAAATGCCAAGCCTACACCTAGGATCTCCAAAGAAGTCGGAGTCCAGCAC 
TAAACGTGTTGTGTCACCAAACAGGTCCGAAATCTACAGCCAATTAGTAA 
TGTCTCCAGGCAAGACCGGGATGCATAAACATATACTTTCTCCAAGCAAG 
GTTGGACTGCACAAAAAGCGTGCCCTGTCTCCAAACAGATCGGAGATGCC 
CACCCAGCATGTCATATCCCCTGGCAAGACCGGACTGCATAAAAATCTAA 
CTGAAAGCACGTTATTCGACAATTTGTCTTCTCCATGCAAACAGCAAAAA 
GTACAGGAAAATCTAAATTCCCCTAAAGGCAAATTATTTGATGTGAAGTC 
AAAATCGATGCCTTACTGTCCATCTCAGTTTTTTGATAATTCTAAGCTTG 
GTGATTTTTCAGAGCTCAACACAGCAGAGAGCAATGACAAAAGTCAGGCT 
GAGAACTGGTGGTATGAAGCAAAAAAAGAAACGGCACCTGAATGTAAAAC 
ATCCTAGATCCCTGTACATCTGACTCTCCTGTCCTGCAAAGAGACTTGCT 
ACTCTGCCTTCTTGTAGGAAGAAACACTAGAAACTGCCATGTCTGCATAA 
AGGAGTCTCACTGGAAGCAAAAGTTGTTCTTTAGTAGTAATCACTGGTTG 
GGCGAGTGGTTACGTCTTTAAAATAAAGTGCAATACGTCTCACGTCTATT 
TTATATGTTATGTCTGTGTATTTGTTACACTTTTAAGTCCCTTGACTTCA 
TATTTGGCTCATCTGTAGTTTCTTTGTGTTTGCGCACACACACACTGGTG 
AAAAATGACATTTGCAGTGTATTGTTTTACTGACTGGTCTCTCTGGGGCC 
ATCCATGCAAAGCACCATTAGTGTGCCAATGTTTTTCACTACTTATTATT 
ATGTCTGACTTTGTGAAATAGAAAATACTACAAAAGATAGGGCAAAAGTT 
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GTTTTCTCACTAGGTAAACGATATGGGTTTAACTTAATTATTCTCCTAAT 
AAAATATTCTATATAATGGCATCTCATTAGGTGACCTAATGGTTATTCTG 
CACTCTTGTAAACCTTTTTTAAATTTCACTCGTAATAAAGCAGCCCTGAT 

TTTAAATTAAAAAAAAAACGGAATTC 
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